Alternatives to Talend Data Catalog
Compare Talend Data Catalog alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Talend Data Catalog in 2025. Compare features, ratings, user reviews, pricing, and more from Talend Data Catalog competitors and alternatives in order to make an informed decision for your business.
-
1
DataHub
DataHub
DataHub Cloud is an event-driven AI & Data Context Platform that uses active metadata for real-time visibility across your entire data ecosystem. Unlike traditional data catalogs that provide outdated snapshots, DataHub Cloud instantly propagates changes, automatically enforces policies, and connects every data source across platforms with 100+ pre-built connectors. Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support. -
2
dbt
dbt Labs
dbt helps data teams transform raw data into trusted, analysis-ready datasets faster. With dbt, data analysts and data engineers can collaborate on version-controlled SQL models, enforce testing and documentation standards, lean on detailed metadata to troubleshoot and optimize pipelines, and deploy transformations reliably at scale. Built on modern software engineering best practices, dbt brings transparency and governance to every step of the data transformation workflow. Thousands of companies, from startups to Fortune 500 enterprises, rely on dbt to improve data quality and trust as well as drive efficiencies and reduce costs as they deliver AI-ready data across their organization. Whether you’re scaling data operations or just getting started, dbt empowers your team to move from raw data to actionable analytics with confidence. -
3
AnalyticsCreator
AnalyticsCreator
AnalyticsCreator is a metadata-driven data warehouse automation solution built specifically for teams working within the Microsoft data ecosystem. It helps organizations speed up the delivery of production-ready data products by automating the entire data engineering lifecycle—from ELT pipeline generation and dimensional modeling to historization and semantic model creation for platforms like Microsoft SQL Server, Azure Synapse Analytics, and Microsoft Fabric. By eliminating repetitive manual coding and reducing the need for multiple disconnected tools, AnalyticsCreator helps data teams reduce tool sprawl and enforce consistent modeling standards across projects. The solution includes built-in support for automated documentation, lineage tracking, schema evolution, and CI/CD integration with Azure DevOps and GitHub. Whether you’re working on data marts, data products, or full-scale enterprise data warehouses, AnalyticsCreator allows you to build faster, govern better, and deliver -
4
OneTrust Privacy Automation
OneTrust
Go beyond compliance and build trust through transparency, choice, and control. People demand greater control of their data, unlocking an opportunity for organizations to use these moments to build trust and deliver more valuable experiences. We provide privacy and data governance automation to help organizations better understand their data across the business, meet regulatory requirements, and operationalize risk mitigation to provide transparency and choice to individuals. Achieve data privacy compliance faster and build trust in your organization. Our platform helps break down silos across processes, workflows, and teams to operationalize regulatory compliance and enable trusted data use. Build proactive privacy programs rooted in global best practices, not reactive to individual regulations. Gain visibility into unknown risks to drive mitigation and risk-based decision making. Respect individual choice and embed privacy and security by default into the data lifecycle. -
5
MANTA
Manta
Manta is the world-class automated approach to visualize, optimize, and modernize how data moves through your organization through code-level lineage. By automatically scanning your data environment with the power of 50+ out-of-the-box scanners, Manta builds a powerful map of all data pipelines to drive efficiency and productivity. Visit manta.io to learn more. With Manta platform, you can make your data a truly enterprise-wide asset, bridge the understanding gap, enable self-service, and easily: • Increase productivity • Accelerate development • Shorten time-to-market • Reduce costs and manual effort • Run instant and accurate root cause and impact analyses • Scope and perform effective cloud migrations • Improve data governance and regulatory compliance (GDPR, CCPA, HIPAA, and more) • Increase data quality • Enhance data privacy and data security -
6
IRI Voracity
IRI, The CoSort Company
Voracity is the only high-performance, all-in-one data management platform accelerating AND consolidating the key activities of data discovery, integration, migration, governance, and analytics. Voracity helps you control your data in every stage of the lifecycle, and extract maximum value from it. Only in Voracity can you: 1) CLASSIFY, profile and diagram enterprise data sources 2) Speed or LEAVE legacy sort and ETL tools 3) MIGRATE data to modernize and WRANGLE data to analyze 4) FIND PII everywhere and consistently MASK it for referential integrity 5) Score re-ID risk and ANONYMIZE quasi-identifiers 6) Create and manage DB subsets or intelligently synthesize TEST data 7) Package, protect and provision BIG data 8) Validate, scrub, enrich and unify data to improve its QUALITY 9) Manage metadata and MASTER data. Use Voracity to comply with data privacy laws, de-muck and govern the data lake, improve the reliability of your analytics, and create safe, smart test data -
7
Alation
Alation
The Alation Agentic Data Intelligence Platform enables organizations to scale and accelerate their AI and data initiatives. By unifying search, cataloging, governance, lineage, and analytics, it transforms metadata into a strategic asset for decision-making. The platform’s AI-powered agents—including Documentation, Data Quality, and Data Products Builder—automate complex data management tasks. With active metadata, workflow automation, and more than 120 pre-built connectors, Alation integrates seamlessly into modern enterprise environments. It helps organizations build trusted AI models by ensuring data quality, transparency, and compliance across the business. Trusted by 40% of the Fortune 100, Alation empowers teams to make faster, more confident decisions with trusted data. -
8
Apache Airflow
The Apache Software Foundation
Airflow is a platform created by the community to programmatically author, schedule and monitor workflows. Airflow has a modular architecture and uses a message queue to orchestrate an arbitrary number of workers. Airflow is ready to scale to infinity. Airflow pipelines are defined in Python, allowing for dynamic pipeline generation. This allows for writing code that instantiates pipelines dynamically. Easily define your own operators and extend libraries to fit the level of abstraction that suits your environment. Airflow pipelines are lean and explicit. Parametrization is built into its core using the powerful Jinja templating engine. No more command-line or XML black-magic! Use standard Python features to create your workflows, including date time formats for scheduling and loops to dynamically generate tasks. This allows you to maintain full flexibility when building your workflows. -
9
OvalEdge
OvalEdge
OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.Starting Price: $1,300/month -
10
Lyftrondata
Lyftrondata
Whether you want to build a governed delta lake, data warehouse, or simply want to migrate from your traditional database to a modern cloud data warehouse, do it all with Lyftrondata. Simply create and manage all of your data workloads on one platform by automatically building your pipeline and warehouse. Analyze it instantly with ANSI SQL, BI/ML tools, and share it without worrying about writing any custom code. Boost the productivity of your data professionals and shorten your time to value. Define, categorize, and find all data sets in one place. Share these data sets with other experts with zero codings and drive data-driven insights. This data sharing ability is perfect for companies that want to store their data once, share it with other experts, and use it multiple times, now and in the future. Define dataset, apply SQL transformations or simply migrate your SQL data processing logic to any cloud data warehouse. -
11
Service Objects Name Validation
Service Objects
Having the correct name is essential to effectively communicating with a customer or lead. Name Validation performs a 40-step check to help your business weed out bogus and inaccurate names and prevent embarrassing personalization mistakes from being sent to customers and prospects. Your brand has a lot riding on getting your customers' and prospects' names right. Accurate names are key to effective personalization and also an important indicator of fraudulent and bogus web form submissions. Name Validation verifies first and last names using a global database of more than 1.4 million first names and 2.75 million last names, correcting common mistakes and flagging garbage before it enters your database. Our real-time name validation and verification service corrects and then tests against a proprietary database containing millions of consumer names to determine an overall quality score. Your business can use this score to block or deny bogus submissions from entering your sales.Starting Price: $299/month -
12
Trifacta
Trifacta
The fastest way to prep data and build data pipelines in the cloud. Trifacta provides visual and intelligent guidance to accelerate data preparation so you can get to insights faster. Poor data quality can sink any analytics project. Trifacta helps you understand your data so you can quickly and accurately clean it up. All the power with none of the code. Trifacta provides visual and intelligent guidance so you can get to insights faster. Manual, repetitive data preparation processes don’t scale. Trifacta helps you build, deploy and manage self-service data pipelines in minutes not months. -
13
WhereScape
WhereScape Software
WhereScape helps IT organizations of all sizes leverage automation to design, develop, deploy, and operate data infrastructure faster. More than 700 customers worldwide rely on WhereScape automation to eliminate hand-coding and other repetitive, time-intensive aspects of data infrastructure projects to deliver data warehouses, vaults, lakes and marts in days or weeks rather than in months or years. From data warehouses and vaults to data lakes and marts, deliver data infrastructure and big data integration fast. Quickly and easily plan, model and design all types of data infrastructure projects. Use sophisticated data discovery and profiling capabilities to bulletproof design and rapid prototyping to collaborate earlier with business users. Fast-track the development, deployment and operation of your data infrastructure projects. Dramatically reduce the delivery time, effort, cost and risk of new projects, and better position projects for future business change. -
14
Stitch
Qlik
Stitch is a cloud-based platform for ETL – extract, transform, and load. More than a thousand companies use Stitch to move billions of records every day from SaaS applications and databases into data warehouses and data lakes. -
15
Atlan
Atlan
The modern data workspace. Make all your data assets from data tables to BI reports, instantly discoverable. Our powerful search algorithms combined with easy browsing experience, make finding the right asset, a breeze. Atlan auto-generates data quality profiles which make detecting bad data, dead easy. From automatic variable type detection & frequency distribution to missing values and outlier detection, we’ve got you covered. Atlan takes the pain away from governing and managing your data ecosystem! Atlan’s bots parse through SQL query history to auto construct data lineage and auto-detect PII data, allowing you to create dynamic access policies & best in class governance. Even non-technical users can directly query across multiple data lakes, warehouses & DBs using our excel-like query builder. Native integrations with tools like Tableau and Jupyter makes data collaboration come alive. -
16
Octopai
Octopai
Harness the power of data lineage, discovery and a data catalog to achieve full control of your data. that can instantly navigate through the most complex data landscapes. Gain access to the most comprehensive automated data lineage, discovery and data catalog. Providing unprecedented visibility and trust into the most complex data environments. Octopai extracts metadata from your entire data environment. With a quick, secure and simple process, Octopai will instantly be able to analyze the metadata. In one centralized platform Octopai allows you to access data lineage, data discovery and a data catalog, automatically. Trace any data end-to-end through your entire data landscape, in seconds. Automatically find the data you need anywhere in your data landscape. Create company-wide consistency with a self-creating, self-updating data catalog. -
17
Datameer
Datameer
Datameer revolutionizes data transformation with a low-code approach, trusted by top global enterprises. Craft, transform, and publish data seamlessly with no code and SQL, simplifying complex data engineering tasks. Empower your data teams to make informed decisions confidently while saving costs and ensuring responsible self-service analytics. Speed up your analytics workflow by transforming datasets to answer ad-hoc questions and support operational dashboards. Empower everyone on your team with our SQL or Drag-and-Drop to transform your data in an intuitive and collaborative workspace. And best of all, everything happens in Snowflake. Datameer is designed and optimized for Snowflake to reduce data movement and increase platform adoption. Some of the problems Datameer solves: - Analytics is not accessible - Drowning in backlog - Long development -
18
Collibra
Collibra
With a best-in-class catalog, flexible governance, continuous quality, and built-in privacy, the Collibra Data Intelligence Cloud is your single system of engagement for data. Support your users with a best-in-class data catalog that includes embedded governance, privacy and quality. Raise the grade, by ensuring teams can quickly find, understand and access data across sources, business applications, BI and data science tools in one central location. Give your data some much-needed privacy. Centralize, automate and guide workflows to encourage collaboration, operationalize privacy and address global regulatory requirements. Get the full story around your data with Collibra Data Lineage. Automatically map relationships between systems, applications and reports to provide a context-rich view across the enterprise. Hone in on the data you care about most and trust that it is relevant, complete and trustworthy. -
19
Dataplex Universal Catalog
Google
Dataplex Universal Catalog is Google Cloud’s intelligent governance platform for data and AI artifacts. It centralizes discovery, management, and monitoring across data lakes, warehouses, and databases, giving teams unified access to trusted data. With Vertex AI integration, users can instantly find datasets, models, features, and related assets in one search experience. It supports semantic search, data lineage, quality checks, and profiling to improve trust and compliance. Integrated with BigQuery and BigLake, it enables end-to-end governance for both proprietary and open lakehouse environments. Dataplex Universal Catalog helps organizations democratize data access, enforce governance, and accelerate analytics and AI initiatives.Starting Price: $0.060 per hour -
20
Decube
Decube
Decube is a data management platform that helps organizations manage their data observability, data catalog, and data governance needs. It provides end-to-end visibility into data and ensures its accuracy, consistency, and trustworthiness. Decube's platform includes data observability, a data catalog, and data governance components that work together to provide a comprehensive solution. The data observability tools enable real-time monitoring and detection of data incidents, while the data catalog provides a centralized repository for data assets, making it easier to manage and govern data usage and access. The data governance tools provide robust access controls, audit reports, and data lineage tracking to demonstrate compliance with regulatory requirements. Decube's platform is customizable and scalable, making it easy for organizations to tailor it to meet their specific data management needs and manage data across different systems, data sources, and departments. -
21
Oracle Cloud Infrastructure (OCI) Data Catalog is a metadata management service that helps data professionals discover data and support data governance. Designed specifically to work well with the Oracle ecosystem, it provides an inventory of assets, a business glossary, and a common metastore for data lakes. OCI Data Catalog is fully managed by Oracle and runs with all the power and scale of Oracle Cloud Infrastructure. Benefit from all of the security, reliability, performance, and scale of Oracle Cloud while using OCI Data Catalog. Using REST APIs and SDKs, developers can integrate OCI Data Catalog’s capabilities in their custom applications. Using a trusted system for managing user identities and access privileges, administrators can control access to data catalog objects and capabilities to manage security requirements. Discover data assets across Oracle data stores on-premises and in the cloud to start gaining real value from data.
-
22
Apache Atlas
Apache Software Foundation
Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration. -
23
erwin Data Intelligence
Quest Software
erwin Data Intelligence (erwin DI) combines data catalog and data literacy capabilities for greater awareness of and access to available data assets, guidance on their use, and guardrails to ensure data policies and best practices are followed. Automatically harvest, transform and feed metadata from a wide array of data sources, operational processes, business applications and data models into a central catalog. Then make it accessible and understandable via role-based, contextual views so stakeholders can make strategic decisions based on accurate insights. erwin DI supports enterprise data governance, digital transformation and any effort that relies on data for favorable outcomes. Schedule ongoing scans of metadata from the widest array of data sources. Easily map data elements from source to target, including data in motion, and harmonize data integration across platforms. Enable data consumers to define and discover data relevant to their roles.Starting Price: $299 per month -
24
Data360 Govern
Precisely
Your organization knows the value of data and the need to get it into the hands of business users for maximum impact, but without enterprise data governance, that data might be hard to find, understand, and trust. Data360 Govern is an enterprise data governance, catalog, and metadata management solution that gives you confidence in the quality, value, and trustworthiness of your data. It automates governance and stewardship tasks to help you answer essential questions about your data’s source, use, meaning, ownership, and quality. With Data360 Govern, you can make faster decisions on data usage and management, build collaboration across your entire organization, and allow users to get the answers they need – when they need them. Transparency into your organization’s data landscape gives you the power to track the critical data aligned with your most important business outcomes. -
25
Catalog
Coalesce
Catalog from Coalesce (formerly CastorDoc) is a data catalog designed for mass adoption across the whole company. Have an overview of all your data environment. Search for data instantly thanks to our powerful search engine. Onboard to a new data infrastructure and access data in a breeze. Go beyond your traditional data catalog. Modern data teams now have numerous data sources, build one truth. With its delightful and automated documentation experience, Catalog makes it dead simple to trust data. Column-level, cross-system data lineage in minutes. Get a bird’s eye view of your data pipelines to build trust in your data. Troubleshoot data issues, perform impact analyses, comply with GDPR in one tool. Optimize performance, cost, compliance, and security for your data. Keep your data stack healthy with our automated infrastructure monitoring system.Starting Price: $699 per month -
26
Select Star
Select Star
Set up your automated data catalog in just 15 minutes, and receive column-level lineage, Entity Relationship (ER) diagram, and auto-populated documentation within 24 hours. Easily find, tag, and add documentation to your data so everyone can find the right dataset for their use case. Select Star automatically detects and displays your column-level data lineage. You can now trust the data, knowing where it came from. Select Star automatically surfaces how your company uses data. That means you can identify relevant data fields without needing to ask someone else. Select Star treats your data with AICPA SOC 2 Security, Confidentiality, and Availability standards, making sure your data is always safe and sound.Starting Price: $270 per month -
27
Global IDs
Global IDs
Find out some of the best platform features provided by Global IDs, which brings in a set of Enterprise Data Solutions like data governance, data compliance, cloud migration, rationalization, privacy, analytics & much more! Global IDs EDA Platform feature comprises a set of core functions: automated discovery and profiling, data classification, data lineage, data quality and more – that render data transparent, trustworthy, and explainable across the ecosystem. Global IDs EDA platform architecture is designed for integration from the ground up with all platform functionality accessible via APIs. Global IDs EDA platform automates data management for enterprises of any size or data ecosystem. -
28
Databricks Data Intelligence Platform
Databricks
The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker. -
29
Dawiso
Dawiso
Dawiso is your modern platform for managing and understanding data, built to unify governance and usability in a way that works for your entire organization. At its core is a powerful, AI-powered data catalog, enabling teams to quickly discover, interpret, and access trusted data across systems, reports, and business tools. With flexible governance features and business-friendly documentation apps, Dawiso bridges the gap between technical and non-technical users, fostering true collaboration. Enhance trust in your data with clear, visual data lineage that maps relationships across sources and systems, giving you full context and control. Support compliance through customizable workflows, role-based access, and structured metadata capture.Starting Price: $49 per user per month -
30
DvSum
DvSum
DvSum is a AI-powered Data Intelligence platform that makes it remarkably easier for your data and analytics teams to discover, monitor, and govern data. With powerful AI-enabled algorithms, DvSum automatically catalogues, classifies, and curates your data and makes it available as an actionable Data Catalog. Propel your enterprise towards its digital and analytics enabled transformation goals with DvSum Data Intelligence.Starting Price: $1000/ per month -
31
DataGalaxy
DataGalaxy
DataGalaxy’s all-in-one data catalog offers out-of-the-box actionability with fully-customizable attributes, visualization tools, and AI integration to give business teams the ability to document, link, and track all their metadata assets. The Data Catalog 360°’s user-centric platform is dedicated to metadata mapping, management, and knowledge sharing to help your organization manage data your way. A data catalog enables employees from all teams to collaborate using centralized, homogeneous data sets. Our data catalog provides clarity on data definitions, synonyms, and essential business attributes with a semantic layer so all users can understand and leverage their data as an asset. When you need answers about specific metadata, turn to the data catalog that identifies a topic’s 360° data experts, owners, and stewards empowering your team through streamlined collaboration. -
32
Dataedo
Dataedo
Discover, document and manage your metadata. Dataedo is equipped with multiple automated metadata scanners that connect to various database technologies, extract data structures and metadata, and load them into the metadata repository. With a few clicks, build a catalog of your data and describe each element. Decrypt table and column names with business-friendly aliases, provide meaning and purpose of data assets with descriptions and user-defined custom fields. Use sample data to learn what data is stored in your data assets. Understand the data better before using it and make sure that the data is good quality. Ensure high data quality with data profiling. Democratize access to knowledge about data. Build data literacy, democratize data and empower everyone in your organization to make better use of your data with a lightweight on-premises data catalog. Boost data literacy through a data catalog.Starting Price: $49 per month -
33
Ataccama ONE
Ataccama
Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data. -
34
Aggua
Aggua
Aggua is a data fabric augmented AI platform that enables data and business teams Access to their data, creating Trust and giving practical Data Insights, for a more holistic, data-centric decision-making. Instead of wondering what is going on underneath the hood of your organization's data stack, become immediately informed with a few clicks. Get access to data cost insights, data lineage and documentation without needing to take time out of your data engineer's workday. Instead of spending a lot of time tracing what a data type change will break in your data pipelines, tables and infrastructure, with automated lineage, your data architects and engineers can spend less time manually going through logs and DAGs and more time actually making the changes to infrastructure. -
35
Tree Schema Data Catalog
Tree Schema
The essential tool for metadata management. Automatically populate your entire catalog in under 5 minutes! Data Discovery. Find the data you need anywhere within your data ecosystem from the database all the way down to the specific values for each field. Automatically document your data from existing data stores. First-class support for tabular and unstructured data. Automated data governance actions. Data Lineage. Explore your data lineage and understand where your data comes from and where it is going. View impact analysis of changes Find all up and downstream impacts. Visualize relationships and connections. API AccessNew. Manage your data lineage as code and keep your catalog up to date with the Tree Schema API. Integrate Data Lineage into CICD pipelines Capture values & descriptions within your code Analyze impact for breaking changes. Data Dictionary. Know the key terms and lingo that drive your business. Define the context and scope for keywordsStarting Price: $99 per month -
36
Blindata
Blindata
Blindata covers all the functions of a Data Governance program: Business Glossary, Data Catalog & Data Lineage build an integrated and complete view on your Data. Data Classification module gives a semantic meaning to the data while the Data Quality, Issue Management & Data Stewardship modules improve the reliability and trust on data. Moreover, privacy compliance can leverage specific features: registry of processing activities, centralized privacy note management, consent registry with Blockchain integrated notarization. Blindata Agent can connect to different data sources, collecting metadata such data structures (Tables, Views, Fields, …), data quality metrics, reverse lineage, etc. Blindata has a modular and entirely API based architecture allowing systematic integration with the most critical business systems (DBMS, Active Directory, e-commerce, Data Platforms). Blindata is available as SaaS, can be installed “on Premise” or purchased on AWS Marketplace.Starting Price: $1000/year/user -
37
Secuvy AI
Secuvy
Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via AI-driven workflows. Best in class data intelligence especially for unstructured data. Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via ai-driven workflows. Best in class data intelligence especially for unstructured data. Automated data discovery, customizable subject access requests, user validations, data maps & workflows for privacy regulations such as ccpa, gdpr, lgpd, pipeda and other global privacy laws. Data intelligence to find sensitive and privacy information across multiple data stores at rest and in motion. In a world where data is growing exponentially, our mission is to help organizations to protect their brand, automate processes, and improve trust with customers. With ever-expanding data sprawls we wish to reduce human efforts, costs & errors for handling Sensitive Data. -
38
Informatica Enterprise Data Catalog
Informatica
Scan and index metadata, discover and profile data, and provide detailed lineage across tens of millions of data sets. Classify and organize data assets across any environment to maximize data value and reuse. Automatically scan across multi-cloud platforms, BI tools, ETL, and third-party metadata catalogs; and data types. Leverage AI-powered domain discovery, data similarity, business term associations, and recommendations. Track data movement, from high-level system views to granular column-level lineage, and get detailed impact analysis. Use the Data Asset Analytics dashboard to understand asset usage, enrichment, and collaboration. View data quality rules, scorecards, metric groups, and profiling stats in context. Tap into shared data knowledge with certifications, ratings and reviews, a Q&A platform, and change notifications. Our broad and deep lineup of enterprise-grade data management solutions sets Informatica apart from the crowd. -
39
Validio
Validio
See how your data assets are used: popularity, utilization, and schema coverage. Get important insights about your data assets such as popularity, utilization, quality, and schema coverage. Find and filter the data you need based on metadata tags and descriptions. Get important insights about your data assets such as popularity, utilization, quality, and schema coverage. Drive data governance and ownership across your organization. Stream-lake-warehouse lineage to facilitate data ownership and collaboration. Automatically generated field-level lineage map to understand the entire data ecosystem. Anomaly detection learns from your data and seasonality patterns, with automatic backfill from historical data. Machine learning-based thresholds are trained per data segment, trained on actual data instead of metadata only. -
40
Acryl Data
Acryl Data
No more data catalog ghost towns. Acryl Cloud drives fast time-to-value via Shift Left practices for data producers and an intuitive UI for data consumers. Continuously detect data quality incidents in real-time, automate anomaly detection to prevent breakages, and drive fast resolution when they do occur. Acryl Cloud supports both push-based and pull-based metadata ingestion for easy maintenance, ensuring information is trustworthy, up-to-date, and definitive. Data should be operational. Go beyond simple visibility and use automated Metadata Tests to continuously expose data insights and surface new areas for improvement. Reduce confusion and accelerate resolution with clear asset ownership, automatic detection, streamlined alerts, and time-based lineage for tracing root causes. -
41
Azure Data Catalog
Microsoft
In the new world of data, you can spend more time looking for data than you do analyzing it. Azure Data Catalog is an enterprise-wide metadata catalog that makes data asset discovery straightforward. It’s a fully-managed service that lets you—from analyst to data scientist to data developer—register, enrich, discover, understand, and consume data sources. Work with data in the tool of your choice. Data Catalog lets you find the data you need and use it in the tools you choose. Your data stays where you want it, and Data Catalog helps you discover and work with it where you want, with an intuitive user experience. ncrease broad adoption and continuous value creation across your data ecosystem. Data Catalog helps you get tips, tricks, and unwritten rules into an experience where everyone can get value. With Data Catalog, everyone can contribute. Democratize data asset discovery.Starting Price: $1 per user per month -
42
Zaloni Arena
Zaloni
End-to-end DataOps built on an agile platform that improves and safeguards your data assets. Arena is the premier augmented data management platform. Our active data catalog enables self-service data enrichment and consumption to quickly control complex data environments. Customizable workflows that increase the accuracy and reliability of every data set. Use machine-learning to identify and align master data assets for better data decisioning. Complete lineage with detailed visualizations alongside masking and tokenization for superior security. We make data management easy. Arena catalogs your data, wherever it is and our extensible connections enable analytics to happen across your preferred tools. Conquer data sprawl challenges: Our software drives business and analytics success while providing the controls and extensibility needed across today’s decentralized, multi-cloud data complexity. -
43
Collate
Collate
Collate is an AI‑driven metadata platform that empowers data teams with automated discovery, observability, quality, and governance through agent‑based workflows. Built on the open source OpenMetadata foundation and a unified metadata graph, it offers 90+ turnkey connectors to ingest metadata from databases, data warehouses, BI tools, and pipelines, delivering in‑depth column‑level lineage, data profiling, and no‑code quality tests. Its AI agents automate data discovery, permission‑aware querying, alerting, and incident‑management workflows at scale, while real‑time dashboards, interactive analyses, and a collaborative business glossary enable both technical and non‑technical users to steward high‑quality data assets. Continuous monitoring and governance automations enforce compliance with standards such as GDPR and CCPA, reducing mean time to resolution for data issues and lowering total cost of ownership.Starting Price: Free -
44
Alex Solutions
Alex Solutions
The Alex Platform is your enterprise’s single source of data and business truth. Alex is a foundational pillar of our customer’s data-driven success. From day one of implementation, Alex is designed to start reducing complexity and creating value immediately. Alex Augmented Data Catalog is powered by the industry’s best machine learning, rapidly providing a unified, enterprise-wide data platform. No matter how complex your technical landscape may be, Alex Data Lineage helps you map and understand your data flows in an automated and secure way. Worldwide teams need worldwide coordination. Alex Intelligent Business Glossary’s beautiful UI and rich functionality is perfect for conducting global collaboration. Combat complexity of the multi-cloud and global enterprise by unifying all definitions, policies, metrics, rules, processes, workflows and more. Power global data governance programs. -
45
Cloudera
Cloudera
Manage and secure the data lifecycle from the Edge to AI in any cloud or data center. Operates across all major public clouds and the private cloud with a public cloud experience everywhere. Integrates data management and analytic experiences across the data lifecycle for data anywhere. Delivers security, compliance, migration, and metadata management across all environments. Open source, open integrations, extensible, & open to multiple data stores and compute architectures. Deliver easier, faster, and safer self-service analytics experiences. Provide self-service access to integrated, multi-function analytics on centrally managed and secured business data while deploying a consistent experience anywhere—on premises or in hybrid and multi-cloud. Enjoy consistent data security, governance, lineage, and control, while deploying the powerful, easy-to-use cloud analytics experiences business users require and eliminating their need for shadow IT solutions. -
46
Anzo
Cambridge Semantics
Anzo is a modern data discovery and integration platform that lets anyone find, connect and blend any enterprise data into analytics-ready datasets. Anzo’s unique use of semantics and graph data models makes it practical for the first time for virtually anyone in your organization – from skilled data scientists to novice business users – to drive the data discovery and integration process and build their own analytics-ready datasets. Anzo’s graph data models provide business users with a visual map of enterprise data that is easy to understand and navigate, even when your data is vast, siloed and complex. Semantics add business content to data, allowing users to harmonize data based on shared definitions and build blended, business-ready data on demand. -
47
1touch.io Inventa
1touch.io
Partial visibility into your data is putting your organization at risk. 1touch.io uses a unique network analytics approach, powerful ML and AI techniques together with unprecedented data lineage accuracy to continuously discover and catalog all your sensitive and protected data into a PII Inventory and a Master Data Catalog. We automatically discover and analyze all usage of data and its lineage without relying upon the organization’s knowledge of the existence or location of the data. Adding a multilayer machine learning analytic engine gives us the ability to “read and understand” the data and link all the pieces into a full picture represented as both a PII inventory and a Master Catalog. Finding your known and unknown sensitive data within your network allows for immediate risk reduction. Organizing your data flow to understand precise data lineage and business processes enables you to achieve core compliance requirements. -
48
Kylo
Teradata
Kylo is an open source enterprise-ready data lake management software platform for self-service data ingest and data preparation with integrated metadata management, governance, security and best practices inspired by Think Big's 150+ big data implementation projects. Self-service data ingest with data cleansing, validation, and automatic profiling. Wrangle data with visual sql and an interactive transform through a simple user interface. Search and explore data and metadata, view lineage, and profile statistics. Monitor health of feeds and services in the data lake. Track SLAs and troubleshoot performance. Design batch or streaming pipeline templates in Apache NiFi and register with Kylo to enable user self-service. Organizations can expend significant engineering effort moving data into Hadoop yet struggle to maintain governance and data quality. Kylo dramatically simplifies data ingest by shifting ingest to data owners through a simple guided UI. -
49
SAP Information Steward software supports data profiling and monitoring and information policy management. As the information governance layer of SAP Business Technology Platform, it can help you anticipate risk and drive better business outcomes. Combine data profiling, data lineage, and metadata management to gain continuous insight into the integrity of your enterprise data model. Gain a better understanding of data quality across your data management landscape,while accessing and analyzing metrics with intuitive dashboards and scorecards. Improve enterprise information management initiatives by supporting analysts, data stewards, and IT experts with consistent validation rules and guidelines. Discover, assess, define, monitor, and improve the quality of your enterprise data assets with data profiling and metadata management – all with one solution. Forecast the potential savings of improved data quality by running what-if analyses.
-
50
Activate business-ready data for AI and analytics with intelligent cataloging, backed by active metadata and policy management. IBM Watson® Knowledge Catalog is a data catalog tool that powers intelligent, self-service discovery of data, models and more. The cloud-based enterprise metadata repository activates information for AI, machine learning (ML) and deep learning. Access, curate, categorize and share data, knowledge assets and their relationships, wherever they reside. Organize, define and manage enterprise data to provide the right context and drive value across needs like regulatory compliance and data monetization. Protect data, manage compliance and audit-readiness, and maintain client trust with active policy management and dynamic masking of sensitive data. Consume and transform data at the speed of business with intuitive dashboards and flows that can be shared with peers or analytics tools.Starting Price: $300 per instance