Alternatives to Aggua

Compare Aggua alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Aggua in 2026. Compare features, ratings, user reviews, pricing, and more from Aggua competitors and alternatives in order to make an informed decision for your business.

  • 1
    DataHub

    DataHub

    DataHub

    DataHub Cloud is an event-driven AI & Data Context Platform that uses active metadata for real-time visibility across your entire data ecosystem. Unlike traditional data catalogs that provide outdated snapshots, DataHub Cloud instantly propagates changes, automatically enforces policies, and connects every data source across platforms with 100+ pre-built connectors. Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support.
    Compare vs. Aggua View Software
    Visit Website
  • 2
    dbt

    dbt

    dbt Labs

    dbt helps data teams transform raw data into trusted, analysis-ready datasets faster. With dbt, data analysts and data engineers can collaborate on version-controlled SQL models, enforce testing and documentation standards, lean on detailed metadata to troubleshoot and optimize pipelines, and deploy transformations reliably at scale. Built on modern software engineering best practices, dbt brings transparency and governance to every step of the data transformation workflow. Thousands of companies, from startups to Fortune 500 enterprises, rely on dbt to improve data quality and trust as well as drive efficiencies and reduce costs as they deliver AI-ready data across their organization. Whether you’re scaling data operations or just getting started, dbt empowers your team to move from raw data to actionable analytics with confidence.
    Compare vs. Aggua View Software
    Visit Website
  • 3
    AnalyticsCreator

    AnalyticsCreator

    AnalyticsCreator

    AnalyticsCreator is a metadata-driven data warehouse automation solution built specifically for teams working within the Microsoft data ecosystem. It helps organizations speed up the delivery of production-ready data products by automating the entire data engineering lifecycle—from ELT pipeline generation and dimensional modeling to historization and semantic model creation for platforms like Microsoft SQL Server, Azure Synapse Analytics, and Microsoft Fabric. By eliminating repetitive manual coding and reducing the need for multiple disconnected tools, AnalyticsCreator helps data teams reduce tool sprawl and enforce consistent modeling standards across projects. The solution includes built-in support for automated documentation, lineage tracking, schema evolution, and CI/CD integration with Azure DevOps and GitHub. Whether you’re working on data marts, data products, or full-scale enterprise data warehouses, AnalyticsCreator allows you to build faster, govern better, and deliver
    Compare vs. Aggua View Software
    Visit Website
  • 4
    Pentaho

    Pentaho

    Hitachi Vantara

    With an integrated product suite providing data integration, analytics, cataloging, optimization and quality, Pentaho+ enables seamless data management, driving innovation and informed decision-making. Pentaho+ has helped customers achieve a 3x increase in improved data trust, a 7x increase in impactful business results and most importantly, a 70% increase in productivity.
  • 5
    OneTrust Privacy Automation
    Go beyond compliance and build trust through transparency, choice, and control. People demand greater control of their data, unlocking an opportunity for organizations to use these moments to build trust and deliver more valuable experiences. We provide privacy and data governance automation to help organizations better understand their data across the business, meet regulatory requirements, and operationalize risk mitigation to provide transparency and choice to individuals. Achieve data privacy compliance faster and build trust in your organization. Our platform helps break down silos across processes, workflows, and teams to operationalize regulatory compliance and enable trusted data use. Build proactive privacy programs rooted in global best practices, not reactive to individual regulations. Gain visibility into unknown risks to drive mitigation and risk-based decision making. Respect individual choice and embed privacy and security by default into the data lifecycle.
  • 6
    MANTA

    MANTA

    Manta

    Manta is the world-class automated approach to visualize, optimize, and modernize how data moves through your organization through code-level lineage. By automatically scanning your data environment with the power of 50+ out-of-the-box scanners, Manta builds a powerful map of all data pipelines to drive efficiency and productivity. Visit manta.io to learn more. With Manta platform, you can make your data a truly enterprise-wide asset, bridge the understanding gap, enable self-service, and easily: • Increase productivity • Accelerate development • Shorten time-to-market • Reduce costs and manual effort • Run instant and accurate root cause and impact analyses • Scope and perform effective cloud migrations • Improve data governance and regulatory compliance (GDPR, CCPA, HIPAA, and more) • Increase data quality • Enhance data privacy and data security
  • 7
    Alation

    Alation

    Alation

    The Alation Agentic Data Intelligence Platform enables organizations to scale and accelerate their AI and data initiatives. By unifying search, cataloging, governance, lineage, and analytics, it transforms metadata into a strategic asset for decision-making. The platform’s AI-powered agents—including Documentation, Data Quality, and Data Products Builder—automate complex data management tasks. With active metadata, workflow automation, and more than 120 pre-built connectors, Alation integrates seamlessly into modern enterprise environments. It helps organizations build trusted AI models by ensuring data quality, transparency, and compliance across the business. Trusted by 40% of the Fortune 100, Alation empowers teams to make faster, more confident decisions with trusted data.
  • 8
    Decube

    Decube

    Decube

    Decube is a data management platform that helps organizations manage their data observability, data catalog, and data governance needs. It provides end-to-end visibility into data and ensures its accuracy, consistency, and trustworthiness. Decube's platform includes data observability, a data catalog, and data governance components that work together to provide a comprehensive solution. The data observability tools enable real-time monitoring and detection of data incidents, while the data catalog provides a centralized repository for data assets, making it easier to manage and govern data usage and access. The data governance tools provide robust access controls, audit reports, and data lineage tracking to demonstrate compliance with regulatory requirements. Decube's platform is customizable and scalable, making it easy for organizations to tailor it to meet their specific data management needs and manage data across different systems, data sources, and departments.
  • 9
    Atlan

    Atlan

    Atlan

    The modern data workspace. Make all your data assets from data tables to BI reports, instantly discoverable. Our powerful search algorithms combined with easy browsing experience, make finding the right asset, a breeze. Atlan auto-generates data quality profiles which make detecting bad data, dead easy. From automatic variable type detection & frequency distribution to missing values and outlier detection, we’ve got you covered. Atlan takes the pain away from governing and managing your data ecosystem! Atlan’s bots parse through SQL query history to auto construct data lineage and auto-detect PII data, allowing you to create dynamic access policies & best in class governance. Even non-technical users can directly query across multiple data lakes, warehouses & DBs using our excel-like query builder. Native integrations with tools like Tableau and Jupyter makes data collaboration come alive.
  • 10
    Databricks Data Intelligence Platform
    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker.
  • 11
    K2View

    K2View

    K2View

    At K2View, we believe that every enterprise should be able to leverage its data to become as disruptive and agile as the best companies in its industry. We make this possible through our patented Data Product Platform, which creates and manages a complete and compliant dataset for every business entity – on demand, and in real time. The dataset is always in sync with its underlying sources, adapts to changes in the source structures, and is instantly accessible to any authorized data consumer. Data Product Platform fuels many operational use cases, including customer 360, data masking and tokenization, test data management, data migration, legacy application modernization, data pipelining and more – to deliver business outcomes in less than half the time, and at half the cost, of any other alternative. The platform inherently supports modern data architectures – data mesh, data fabric, and data hub – and deploys in cloud, on-premise, or hybrid environments.
  • 12
    DataGalaxy

    DataGalaxy

    DataGalaxy

    DataGalaxy is a next-generation data governance and intelligence platform designed to help organizations manage, understand, and maximize the value of their data. Built around a unified interface, it empowers everyone—from executives to data consumers—to collaborate seamlessly across data assets, strategies, and analytics. The platform’s automated data catalog, governance hub, and AI co-pilot reduce manual work while ensuring compliance and data quality across systems. With over 70+ integrations, including Snowflake, Databricks, Power BI, and AWS, DataGalaxy connects your data ecosystem into a single source of truth. Its value tracking center and strategy cockpit align data initiatives with business goals, driving measurable outcomes and enterprise-wide visibility. Loved by users, DataGalaxy turns governance into a strategic advantage for the modern enterprise.
  • 13
    Dataplex Universal Catalog
    Dataplex Universal Catalog is Google Cloud’s intelligent governance platform for data and AI artifacts. It centralizes discovery, management, and monitoring across data lakes, warehouses, and databases, giving teams unified access to trusted data. With Vertex AI integration, users can instantly find datasets, models, features, and related assets in one search experience. It supports semantic search, data lineage, quality checks, and profiling to improve trust and compliance. Integrated with BigQuery and BigLake, it enables end-to-end governance for both proprietary and open lakehouse environments. Dataplex Universal Catalog helps organizations democratize data access, enforce governance, and accelerate analytics and AI initiatives.
    Starting Price: $0.060 per hour
  • 14
    Collibra

    Collibra

    Collibra

    With a best-in-class catalog, flexible governance, continuous quality, and built-in privacy, the Collibra Data Intelligence Cloud is your single system of engagement for data. Support your users with a best-in-class data catalog that includes embedded governance, privacy and quality. Raise the grade, by ensuring teams can quickly find, understand and access data across sources, business applications, BI and data science tools in one central location. Give your data some much-needed privacy. Centralize, automate and guide workflows to encourage collaboration, operationalize privacy and address global regulatory requirements. Get the full story around your data with Collibra Data Lineage. Automatically map relationships between systems, applications and reports to provide a context-rich view across the enterprise. Hone in on the data you care about most and trust that it is relevant, complete and trustworthy.
  • 15
    Zaloni Arena
    End-to-end DataOps built on an agile platform that improves and safeguards your data assets. Arena is the premier augmented data management platform. Our active data catalog enables self-service data enrichment and consumption to quickly control complex data environments. Customizable workflows that increase the accuracy and reliability of every data set. Use machine-learning to identify and align master data assets for better data decisioning. Complete lineage with detailed visualizations alongside masking and tokenization for superior security. We make data management easy. Arena catalogs your data, wherever it is and our extensible connections enable analytics to happen across your preferred tools. Conquer data sprawl challenges: Our software drives business and analytics success while providing the controls and extensibility needed across today’s decentralized, multi-cloud data complexity.
  • 16
    Validio

    Validio

    Validio

    See how your data assets are used: popularity, utilization, and schema coverage. Get important insights about your data assets such as popularity, utilization, quality, and schema coverage. Find and filter the data you need based on metadata tags and descriptions. Get important insights about your data assets such as popularity, utilization, quality, and schema coverage. Drive data governance and ownership across your organization. Stream-lake-warehouse lineage to facilitate data ownership and collaboration. Automatically generated field-level lineage map to understand the entire data ecosystem. Anomaly detection learns from your data and seasonality patterns, with automatic backfill from historical data. Machine learning-based thresholds are trained per data segment, trained on actual data instead of metadata only.
  • 17
    Datameer

    Datameer

    Datameer

    Datameer revolutionizes data transformation with a low-code approach, trusted by top global enterprises. Craft, transform, and publish data seamlessly with no code and SQL, simplifying complex data engineering tasks. Empower your data teams to make informed decisions confidently while saving costs and ensuring responsible self-service analytics. Speed up your analytics workflow by transforming datasets to answer ad-hoc questions and support operational dashboards. Empower everyone on your team with our SQL or Drag-and-Drop to transform your data in an intuitive and collaborative workspace. And best of all, everything happens in Snowflake. Datameer is designed and optimized for Snowflake to reduce data movement and increase platform adoption. Some of the problems Datameer solves: - Analytics is not accessible - Drowning in backlog - Long development
  • 18
    Sifflet

    Sifflet

    Sifflet

    Automatically cover thousands of tables with ML-based anomaly detection and 50+ custom metrics. Comprehensive data and metadata monitoring. Exhaustive mapping of all dependencies between assets, from ingestion to BI. Enhanced productivity and collaboration between data engineers and data consumers. Sifflet seamlessly integrates into your data sources and preferred tools and can run on AWS, Google Cloud Platform, and Microsoft Azure. Keep an eye on the health of your data and alert the team when quality criteria aren’t met. Set up in a few clicks the fundamental coverage of all your tables. Configure the frequency of runs, their criticality, and even customized notifications at the same time. Leverage ML-based rules to detect any anomaly in your data. No need for an initial configuration. A unique model for each rule learns from historical data and from user feedback. Complement the automated rules with a library of 50+ templates that can be applied to any asset.
  • 19
    Tree Schema Data Catalog
    The essential tool for metadata management. Automatically populate your entire catalog in under 5 minutes! Data Discovery. Find the data you need anywhere within your data ecosystem from the database all the way down to the specific values for each field. Automatically document your data from existing data stores. First-class support for tabular and unstructured data. Automated data governance actions. Data Lineage. Explore your data lineage and understand where your data comes from and where it is going. View impact analysis of changes Find all up and downstream impacts. Visualize relationships and connections. API AccessNew. Manage your data lineage as code and keep your catalog up to date with the Tree Schema API. Integrate Data Lineage into CICD pipelines Capture values & descriptions within your code Analyze impact for breaking changes. Data Dictionary. Know the key terms and lingo that drive your business. Define the context and scope for keywords
    Starting Price: $99 per month
  • 20
    Accurity

    Accurity

    Accurity

    With Accurity, the all-in-one data intelligence platform, you get a company-wide understanding and complete trust in your data — speed up business-critical decision making, increase your revenue, reduce your costs, and ensure your company’s data compliance. Equipped with timely, relevant, and accurate data, you can successfully satisfy and engage with your customers, elevating your brand awareness and driving sales conversions. With everything accessible from a single interface, automated quality checks, and data quality issue workflows, you can lower personnel and infrastructure costs, and spend time utilizing your data rather than just managing it. Discover real value in your data by revealing and removing inefficiencies, improving your decision-making processes, and finding valuable product and customer information to boost your company’s innovation.
  • 21
    Ab Initio

    Ab Initio

    Ab Initio

    Data arrives from every direction, growing in scale and complexity. Hidden in the data is knowledge and insight that is full of potential. Such potential is only fully realized when it permeates through to every decision and action the organization takes, second by second. As the business changes, so does the data itself, resulting in new knowledge and insight. A cycle is formed, learn and adapt. Industries as far ranging as financial services, healthcare, telecommunications, manufacturing, transportation, and entertainment have recognized the opportunity. Getting there is both challenging and exciting. Success demands new levels of speed and agility in understanding, managing, and processing vast amounts of continuously changing data. Complex organizations require a high performance data platform that is built for automation and self-service, that thrives amid change and adapts to new realities, and that can solve the toughest data processing and data management challenges.
  • 22
    Ataccama ONE
    Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data.
  • 23
    ThinkData Works

    ThinkData Works

    ThinkData Works

    Data is the backbone of effective decision-making. However, employees spend more time managing it than using it. ThinkData Works provides a robust catalog platform for discovering, managing, and sharing data from both internal and external sources. Enrichment solutions combine partner data with your existing datasets to produce uniquely valuable assets that can be shared across your entire organization. Unlock the value of your data investment by making data teams more efficient, improving project outcomes, replacing multiple existing tech solutions, and providing you with a competitive advantage.
  • 24
    Data360 Govern
    Your organization knows the value of data and the need to get it into the hands of business users for maximum impact, but without enterprise data governance, that data might be hard to find, understand, and trust. Data360 Govern is an enterprise data governance, catalog, and metadata management solution that gives you confidence in the quality, value, and trustworthiness of your data. It automates governance and stewardship tasks to help you answer essential questions about your data’s source, use, meaning, ownership, and quality. With Data360 Govern, you can make faster decisions on data usage and management, build collaboration across your entire organization, and allow users to get the answers they need – when they need them. Transparency into your organization’s data landscape gives you the power to track the critical data aligned with your most important business outcomes.
  • 25
    Catalog

    Catalog

    Coalesce

    Catalog from Coalesce (formerly CastorDoc) is a data catalog designed for mass adoption across the whole company. Have an overview of all your data environment. Search for data instantly thanks to our powerful search engine. Onboard to a new data infrastructure and access data in a breeze. Go beyond your traditional data catalog. Modern data teams now have numerous data sources, build one truth. With its delightful and automated documentation experience, Catalog makes it dead simple to trust data. Column-level, cross-system data lineage in minutes. Get a bird’s eye view of your data pipelines to build trust in your data. Troubleshoot data issues, perform impact analyses, comply with GDPR in one tool. Optimize performance, cost, compliance, and security for your data. Keep your data stack healthy with our automated infrastructure monitoring system.
    Starting Price: $699 per month
  • 26
    Apache Atlas

    Apache Atlas

    Apache Software Foundation

    Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration.
  • 27
    Mozart Data

    Mozart Data

    Mozart Data

    Mozart Data is the all-in-one modern data platform that makes it easy to consolidate, organize, and analyze data. Start making data-driven decisions by setting up a modern data stack in an hour - no engineering required.
  • 28
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 29
    Talend Data Catalog
    Talend Data Catalog gives your organization a single, secure point of control for your data. With robust tools for search and discovery, and connectors to extract metadata from virtually any data source, Data Catalog makes it easy to protect your data, govern your analytics, manage data pipelines, and accelerate your ETL processes. Data Catalog automatically crawls, profiles, organizes, links, and enriches all your metadata. Up to 80% of the information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning, continually delivering the most current data to the user. Make data governance a team sport with a secure single point of control where you can collaborate to improve data accessibility, accuracy, and business relevance. Support data privacy and regulatory compliance with intelligent data lineage tracing and compliance tracking.
  • 30
    Blindata

    Blindata

    Blindata

    Blindata covers all the functions of a Data Governance program: Business Glossary, Data Catalog & Data Lineage build an integrated and complete view on your Data. Data Classification module gives a semantic meaning to the data while the Data Quality, Issue Management & Data Stewardship modules improve the reliability and trust on data. Moreover, privacy compliance can leverage specific features: registry of processing activities, centralized privacy note management, consent registry with Blockchain integrated notarization. Blindata Agent can connect to different data sources, collecting metadata such data structures (Tables, Views, Fields, …), data quality metrics, reverse lineage, etc. Blindata has a modular and entirely API based architecture allowing systematic integration with the most critical business systems (DBMS, Active Directory, e-commerce, Data Platforms). Blindata is available as SaaS, can be installed “on Premise” or purchased on AWS Marketplace.
    Starting Price: $1000/year/user
  • 31
    Secuvy AI
    Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via AI-driven workflows. Best in class data intelligence especially for unstructured data. Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via ai-driven workflows. Best in class data intelligence especially for unstructured data. Automated data discovery, customizable subject access requests, user validations, data maps & workflows for privacy regulations such as ccpa, gdpr, lgpd, pipeda and other global privacy laws. Data intelligence to find sensitive and privacy information across multiple data stores at rest and in motion. In a world where data is growing exponentially, our mission is to help organizations to protect their brand, automate processes, and improve trust with customers. With ever-expanding data sprawls we wish to reduce human efforts, costs & errors for handling Sensitive Data.
  • 32
    Select Star

    Select Star

    Select Star

    Set up your automated data catalog in just 15 minutes, and receive column-level lineage, Entity Relationship (ER) diagram, and auto-populated documentation within 24 hours. Easily find, tag, and add documentation to your data so everyone can find the right dataset for their use case. Select Star automatically detects and displays your column-level data lineage. You can now trust the data, knowing where it came from. Select Star automatically surfaces how your company uses data. That means you can identify relevant data fields without needing to ask someone else. Select Star treats your data with AICPA SOC 2 Security, Confidentiality, and Availability standards, making sure your data is always safe and sound.
    Starting Price: $270 per month
  • 33
    Rocket Data Intelligence
    Rocket® Data Intelligence (RDI) delivers comprehensive visibility into enterprise data across mainframe, distributed, and cloud environments. It automatically discovers metadata, lineage, and data relationships so organizations can see where critical data resides, how it moves, and which applications and processes rely on it. RDI supports legacy and modern platforms, including Db2, VSAM, IMS, Adabas, Datacom, relational databases, ETL tools like Informatica and DataStage, code such as COBOL, Python, and Java, and cloud data stores. RDI provides enterprise-grade capabilities including automated data discovery and code parsing, impact analysis, lineage filtering, role/LOB-based categorization and governance, workflow management, business glossary, and dependency mapping. By unifying data asset visibility across hybrid environments, RDI reduces operational risk and accelerates data modernization, compliance reporting, discovery, and rationalization initiatives.
  • 34
    Acryl Data

    Acryl Data

    Acryl Data

    No more data catalog ghost towns. Acryl Cloud drives fast time-to-value via Shift Left practices for data producers and an intuitive UI for data consumers. Continuously detect data quality incidents in real-time, automate anomaly detection to prevent breakages, and drive fast resolution when they do occur. Acryl Cloud supports both push-based and pull-based metadata ingestion for easy maintenance, ensuring information is trustworthy, up-to-date, and definitive. Data should be operational. Go beyond simple visibility and use automated Metadata Tests to continuously expose data insights and surface new areas for improvement. Reduce confusion and accelerate resolution with clear asset ownership, automatic detection, streamlined alerts, and time-based lineage for tracing root causes.
  • 35
    Dawiso

    Dawiso

    Dawiso

    Dawiso is your modern platform for managing and understanding data, built to unify governance and usability in a way that works for your entire organization. At its core is a powerful, AI-powered data catalog, enabling teams to quickly discover, interpret, and access trusted data across systems, reports, and business tools. With flexible governance features and business-friendly documentation apps, Dawiso bridges the gap between technical and non-technical users, fostering true collaboration. Enhance trust in your data with clear, visual data lineage that maps relationships across sources and systems, giving you full context and control. Support compliance through customizable workflows, role-based access, and structured metadata capture.
    Starting Price: $49 per user per month
  • 36
    Foundational

    Foundational

    Foundational

    Identify code and optimization issues in real-time, prevent data incidents pre-deploy, and govern data-impacting code changes end to end—from the operational database to the user-facing dashboard. Automated, column-level data lineage, from the operational database all the way to the reporting layer, ensures every dependency is analyzed. Foundational automates data contract enforcement by analyzing every repository from upstream to downstream, directly from source code. Use Foundational to proactively identify code and data issues, find and prevent issues, and create controls and guardrails. Foundational can be set up in minutes with no code changes required.
  • 37
    Kylo

    Kylo

    Teradata

    Kylo is an open source enterprise-ready data lake management software platform for self-service data ingest and data preparation with integrated metadata management, governance, security and best practices inspired by Think Big's 150+ big data implementation projects. Self-service data ingest with data cleansing, validation, and automatic profiling. Wrangle data with visual sql and an interactive transform through a simple user interface. Search and explore data and metadata, view lineage, and profile statistics. Monitor health of feeds and services in the data lake. Track SLAs and troubleshoot performance. Design batch or streaming pipeline templates in Apache NiFi and register with Kylo to enable user self-service. Organizations can expend significant engineering effort moving data into Hadoop yet struggle to maintain governance and data quality. Kylo dramatically simplifies data ingest by shifting ingest to data owners through a simple guided UI.
  • 38
    Global IDs

    Global IDs

    Global IDs

    Find out some of the best platform features provided by Global IDs, which brings in a set of Enterprise Data Solutions like data governance, data compliance, cloud migration, rationalization, privacy, analytics & much more! Global IDs EDA Platform feature comprises a set of core functions: automated discovery and profiling, data classification, data lineage, data quality and more – that render data transparent, trustworthy, and explainable across the ecosystem. Global IDs EDA platform architecture is designed for integration from the ground up with all platform functionality accessible via APIs. Global IDs EDA platform automates data management for enterprises of any size or data ecosystem.
  • 39
    Collate

    Collate

    Collate

    Collate is an AI‑driven metadata platform that empowers data teams with automated discovery, observability, quality, and governance through agent‑based workflows. Built on the open source OpenMetadata foundation and a unified metadata graph, it offers 90+ turnkey connectors to ingest metadata from databases, data warehouses, BI tools, and pipelines, delivering in‑depth column‑level lineage, data profiling, and no‑code quality tests. Its AI agents automate data discovery, permission‑aware querying, alerting, and incident‑management workflows at scale, while real‑time dashboards, interactive analyses, and a collaborative business glossary enable both technical and non‑technical users to steward high‑quality data assets. Continuous monitoring and governance automations enforce compliance with standards such as GDPR and CCPA, reducing mean time to resolution for data issues and lowering total cost of ownership.
    Starting Price: Free
  • 40
    datuum.ai
    AI-powered data integration tool that helps streamline the process of customer data onboarding. It allows for easy and fast automated data integration from various sources without coding, reducing preparation time to just a few minutes. With Datuum, organizations can efficiently extract, ingest, transform, migrate, and establish a single source of truth for their data, while integrating it into their existing data storage. Datuum is a no-code product and can reduce up to 80% of the time spent on data-related tasks, freeing up time for organizations to focus on generating insights and improving the customer experience. With over 40 years of experience in data management and operations, we at Datuum have incorporated our expertise into the core of our product, addressing the key challenges faced by data engineers and managers and ensuring that the platform is user-friendly, even for non-technical specialists.
  • 41
    Dataedo

    Dataedo

    Dataedo

    Discover, document and manage your metadata. Dataedo is equipped with multiple automated metadata scanners that connect to various database technologies, extract data structures and metadata, and load them into the metadata repository. With a few clicks, build a catalog of your data and describe each element. Decrypt table and column names with business-friendly aliases, provide meaning and purpose of data assets with descriptions and user-defined custom fields. Use sample data to learn what data is stored in your data assets. Understand the data better before using it and make sure that the data is good quality. Ensure high data quality with data profiling. Democratize access to knowledge about data. Build data literacy, democratize data and empower everyone in your organization to make better use of your data with a lightweight on-premises data catalog. Boost data literacy through a data catalog.
    Starting Price: $49 per month
  • 42
    erwin Data Intelligence
    erwin Data Intelligence (erwin DI) combines data catalog and data literacy capabilities for greater awareness of and access to available data assets, guidance on their use, and guardrails to ensure data policies and best practices are followed. Automatically harvest, transform and feed metadata from a wide array of data sources, operational processes, business applications and data models into a central catalog. Then make it accessible and understandable via role-based, contextual views so stakeholders can make strategic decisions based on accurate insights. erwin DI supports enterprise data governance, digital transformation and any effort that relies on data for favorable outcomes. Schedule ongoing scans of metadata from the widest array of data sources. Easily map data elements from source to target, including data in motion, and harmonize data integration across platforms. Enable data consumers to define and discover data relevant to their roles.
    Starting Price: $299 per month
  • 43
    Anzo

    Anzo

    Cambridge Semantics

    Anzo is a modern data discovery and integration platform that lets anyone find, connect and blend any enterprise data into analytics-ready datasets. Anzo’s unique use of semantics and graph data models makes it practical for the first time for virtually anyone in your organization – from skilled data scientists to novice business users – to drive the data discovery and integration process and build their own analytics-ready datasets. Anzo’s graph data models provide business users with a visual map of enterprise data that is easy to understand and navigate, even when your data is vast, siloed and complex. Semantics add business content to data, allowing users to harmonize data based on shared definitions and build blended, business-ready data on demand.
  • 44
    IBM watsonx.data integration
    IBM watsonx.data integration is a data integration platform designed to help organizations transform raw data into AI-ready data at scale. The platform enables data teams to build, manage, and optimize data pipelines across multiple environments, including on-premises systems and hybrid or multi-cloud infrastructures. With a unified control plane, watsonx.data integration supports multiple integration styles such as batch processing, real-time streaming, and data replication within a single solution. The platform also offers no-code, low-code, and pro-code development options, allowing both technical and non-technical users to design and manage data pipelines efficiently. By simplifying data integration workflows and reducing reliance on multiple tools, watsonx.data integration helps organizations deliver reliable data for analytics and AI applications.
  • 45
    Octopai

    Octopai

    Octopai

    Harness the power of data lineage, discovery and a data catalog to achieve full control of your data. that can instantly navigate through the most complex data landscapes. Gain access to the most comprehensive automated data lineage, discovery and data catalog. Providing unprecedented visibility and trust into the most complex data environments. Octopai extracts metadata from your entire data environment. With a quick, secure and simple process, Octopai will instantly be able to analyze the metadata. In one centralized platform Octopai allows you to access data lineage, data discovery and a data catalog, automatically. Trace any data end-to-end through your entire data landscape, in seconds. Automatically find the data you need anywhere in your data landscape. Create company-wide consistency with a self-creating, self-updating data catalog.
  • 46
    Talend Data Fabric
    Talend Data Fabric’s suite of cloud services efficiently handles all your integration and integrity challenges — on-premises or in the cloud, any source, any endpoint. Deliver trusted data at the moment you need it — for every user, every time. Ingest and integrate data, applications, files, events and APIs from any source or endpoint to any location, on-premise and in the cloud, easier and faster with an intuitive interface and no coding. Embed quality into data management and guarantee ironclad regulatory compliance with a thoroughly collaborative, pervasive and cohesive approach to data governance. Make the most informed decisions based on high quality, trustworthy data derived from batch and real-time processing and bolstered with market-leading data cleaning and enrichment tools. Get more value from your data by making it available internally and externally. Extensive self-service capabilities make building APIs easy— improve customer engagement.
  • 47
    Informatica Enterprise Data Catalog
    Scan and index metadata, discover and profile data, and provide detailed lineage across tens of millions of data sets. Classify and organize data assets across any environment to maximize data value and reuse. Automatically scan across multi-cloud platforms, BI tools, ETL, and third-party metadata catalogs; and data types. Leverage AI-powered domain discovery, data similarity, business term associations, and recommendations. Track data movement, from high-level system views to granular column-level lineage, and get detailed impact analysis. Use the Data Asset Analytics dashboard to understand asset usage, enrichment, and collaboration. View data quality rules, scorecards, metric groups, and profiling stats in context. Tap into shared data knowledge with certifications, ratings and reviews, a Q&A platform, and change notifications. Our broad and deep lineup of enterprise-grade data management solutions sets Informatica apart from the crowd.
  • 48
    CloverDX

    CloverDX

    CloverDX

    Design, debug, run and troubleshoot data transformations and jobflows in a developer-friendly visual designer. Orchestrate data workloads that require tasks to be carried out in the right sequence, orchestrate multiple systems with the transparency of visual workflows. Deploy data workloads easily into a robust enterprise runtime environment. In cloud or on-premise. Make data available to people, applications and storage under a single unified platform. Manage your data workloads and related processes together in a single platform. No task is too complex. We’ve built CloverDX on years of experience with large enterprise projects. Developer-friendly open architecture and flexibility lets you package and hide the complexity for non-technical users. Manage the entire lifecycle of a data pipeline from design, deployment to evolution and testing. Get things done fast with the help of our in-house customer success teams.
    Starting Price: $5000.00/one-time
  • 49
    Securiti

    Securiti

    Securiti

    Securiti is the pioneer of the Data Command Center, a centralized platform that enables the safe use of data and GenAI. It provides unified data intelligence, controls and orchestration across hybrid multicloud environments. Large global enterprises rely on Securiti's Data Command Center for data security, privacy, governance, and compliance. Securiti has been recognized with numerous industry and analyst awards, including "Most Innovative Startup" by RSA, "Top 25 Machine Learning Startups" by Forbes, "Most Innovative AI Companies" by CB Insights, "Cool Vendor in Data Security" by Gartner, and "Privacy Management Wave Leader" by Forrester. For more information, please follow us on LinkedIn and visit Securiti.ai.
  • 50
    Tokern

    Tokern

    Tokern

    Open source data governance suite for databases and data lakes. Tokern is a simple to use toolkit to collect, organize and analyze data lake's metadata. Run as a command-line app for quick tasks. Run as a service for continuous collection of metadata. Analyze lineage, access control and PII datasets using reporting dashboards or programmatically in Jupyter notebooks. Tokern is an open source data governance suite for databases and data lakes. Improve ROI of your data, comply with regulations like HIPAA, CCPA and GDPR and protect critical data from insider threats with confidence. Centralized metadata management of users, datasets and jobs. Powers other data governance features. Track Column Level Data Lineage for Snowflake, AWS Redshift and BigQuery. Build lineage from query history or ETL scripts. Explore lineage using interactive graphs or programmatically using APIs or SDKs.