Introduction to ExfDigital Studio : Our latest video is live. Watch now!
ExfDataScribe
ExfDataScribe
Release Date
5 Jan, 2025
Release Version
v1.0
ExfDataScribe is an intelligent metadata management tool that leverages Large Language Models to automate the generation of table and column descriptions for databases. It supports seamless connectivity to databases, enabling users to select schemas and tables for profiling and description generation. The application intelligently identifies Personally Identifiable Information (PII) and allows users to edit and customize the metadata. Designed as a cloud-agnostic, no-code/low-code tool, ExfDataScribe is ideal for regulatory documentation and metadata standardization, with export options in Excel, Word, or PDF formats.
Key Features
  • Seamless Database Connectivity: Connects to Snowflake accounts for database and schema selection.
  • Data Profiling Support: Optionally profiles columns to extract insights (e.g., min, max, average, distinct values) to enhance descriptions.
  • Customizable Metadata: Allows users to refine generated descriptions and PII tags for accuracy and context.
  • Automated Metadata Generation: Leverages LLMs to generate high-quality descriptions for tables and columns.
  • PII Detection: Automatically identifies and tags columns as PII or Non-PII for compliance purposes.
  • Flexible Export Options: Download metadata in Excel, Word, or PDF formats for regulatory or business documentation.
Specifications
Minimum Software Requirements
  • Operating System: Linux (Ubuntu 20.04 or later)
  • Python Version: 3.9+
  • Database: MongoDB & PostgreSQL 12+ or equivalent db/dw
  • Containerization: Docker 20.10+ / Kubernetes 1.20+
Minimum Hardware Requirements
  • CPU: 4 Cores
  • RAM: 8 GB
  • Storage: 100 GB free disk space
Resources
  • Docker Compose File: Link
  • Kubernetes YAML File: Link