55 Integrations with Astro by Astronomer
View a list of Astro by Astronomer integrations and software that integrates with Astro by Astronomer below. Compare the best Astro by Astronomer integrations as well as features, ratings, user reviews, and pricing of software that integrates with Astro by Astronomer. Here are the current Astro by Astronomer integrations in 2026:
-
1
Google Cloud Platform
Google
Google Cloud is a cloud-based service that allows you to create anything from simple websites to complex applications for businesses of all sizes. New customers get $300 in free credits to run, test, and deploy workloads. All customers can use 25+ products for free, up to monthly usage limits. Use Google's core infrastructure, data analytics & machine learning. Secure and fully featured for all enterprises. Tap into big data to find answers faster and build better products. Grow from prototype to production to planet-scale, without having to think about capacity, reliability or performance. From virtual machines with proven price/performance advantages to a fully managed app development platform. Scalable, resilient, high performance object storage and databases for your applications. State-of-the-art software-defined networking products on Google’s private fiber network. Fully managed data warehousing, batch and stream processing, data exploration, Hadoop/Spark, and messaging.Starting Price: Free ($300 in free credits) -
2
Zendesk
Zendesk
Zendesk is an AI-powered service solution that’s easy to set up, use, and scale. It works out-of-the-box and adapts quickly, enabling businesses to move faster. Built on billions of CX interactions, Zendesk AI supports the whole service journey—from self-service to agents to admins—helping teams resolve issues faster and operate efficiently at scale. Zendesk empowers agents with tools, insights, and context to deliver personalized service on any channel—social messaging, phone, or email. It unifies personalized conversations, omnichannel case management, AI workflows, automation, and a Marketplace of 1200+ apps. Easy to implement, it frees teams from relying on IT or costly partners. Serving over 130K global brands in 30+ languages, Zendesk simplifies business complexity to create meaningful customer connections. Headquartered in San Francisco, it operates worldwide.Starting Price: $25/agent/month -
3
Hightouch
Hightouch
Hightouch is the leading Composable CDP and AI Decisioning platform that empowers companies to activate their data warehouse to power personalized marketing and business operations. Trusted by leading organizations like PetSmart, Warner Music Group, Chime, Aritzia, and Whoop, Hightouch enables anyone to deliver personalized customer experiences, optimize performance marketing, and move faster by leveraging data and AI across their organization. Hightouch’s Composable Customer Data Platform collects behavioral events, resolves identities into Customer 360 profiles, builds audiences, syncs data to over 200 destinations like ad platforms and CRMs, and analyzes the results of marketing campaigns— all from the cloud data warehouse.Starting Price: $350 per month -
4
Salesforce
Salesforce
Put the power of the #1 CRM to work, at a price that works for you. Launch and grow quickly with the leading AI CRM built for small businesses in any industry - available through the Starter Suite or Pro Suite. Connect marketing, sales, service, and commerce on one easy platform. Save time with quick setup and smart guidance. Harness unified data and AI to fuel your growth. Start simple with Starter Suite — an all-in-one CRM for small businesses. Scale smoothly with AI agents, integrated data, and apps in one platform. No installation needed, just sign up and go from your browser. Advance further with Pro Suite, the complete CRM that scales with your business. Automate tasks and customize your tools to deepen customer relationships and drive growth.Starting Price: $25.00/month/user -
5
Docker
Docker
Docker takes away repetitive, mundane configuration tasks and is used throughout the development lifecycle for fast, easy and portable application development, desktop and cloud. Docker’s comprehensive end-to-end platform includes UIs, CLIs, APIs and security that are engineered to work together across the entire application delivery lifecycle. Get a head start on your coding by leveraging Docker images to efficiently develop your own unique applications on Windows and Mac. Create your multi-container application using Docker Compose. Integrate with your favorite tools throughout your development pipeline, Docker works with all development tools you use including VS Code, CircleCI and GitHub. Package applications as portable container images to run in any environment consistently from on-premises Kubernetes to AWS ECS, Azure ACI, Google GKE and more. Leverage Docker Trusted Content, including Docker Official Images and images from Docker Verified Publishers.Starting Price: $7 per month -
6
Kubernetes
Kubernetes
Kubernetes (K8s) is an open-source system for automating deployment, scaling, and management of containerized applications. It groups containers that make up an application into logical units for easy management and discovery. Kubernetes builds upon 15 years of experience of running production workloads at Google, combined with best-of-breed ideas and practices from the community. Designed on the same principles that allows Google to run billions of containers a week, Kubernetes can scale without increasing your ops team. Whether testing locally or running a global enterprise, Kubernetes flexibility grows with you to deliver your applications consistently and easily no matter how complex your need is. Kubernetes is open source giving you the freedom to take advantage of on-premises, hybrid, or public cloud infrastructure, letting you effortlessly move workloads to where it matters to you.Starting Price: Free -
7
Slack
Salesforce
Slack is a cloud-based project collaboration and team interaction software solution specially designed to seamlessly facilitate communication across organizations. Featuring powerful tools and services integrated into a single platform, Slack provides private channels to promote interaction within smaller teams, direct channels to help send messages directly to colleagues, and public channels that enables members across organizations to start conversations. Available on Mac, Windows, Android, and iOS apps, Slack offers a plethora of features that include chat, file sharing, collaborative workspace, real-time notifications, two-way audio and video, screen sharing, document imaging, activity tracking and logging, and more.Starting Price: $6.67 per user per month -
8
Asana
Asana
Asana is a collaborative work management platform that brings teams, projects, and goals together in one centralized workspace. It helps organizations plan, track, and deliver work more efficiently by connecting daily tasks to strategic objectives. With built-in AI capabilities, Asana reduces manual effort by automating updates, surfacing insights, and keeping work on track. The platform supports teams of any size, from small businesses to large enterprises, across a wide range of use cases. Asana integrates with over 300 popular business tools, making it easy to fit into existing workflows. Strong security, compliance options, and Asana Gov make it suitable for regulated industries and government agencies. Trusted by a majority of Fortune 100 companies, Asana is designed to scale as organizations grow.Starting Price: Free -
9
Discord
Discord
Discord is a free game communications app designed for both desktop and mobile platforms. Millions of players use the popular game platform every day to chat with friends over voice or text, or even stream gameplay in crystal clear quality for other Discord users. Not only can you organize a voice/text party in seconds, you can also use the service to find other players/teammates, search for certain types of groups/activities, or just talk games during your off time. The best part is that Discord is not designed for any specific genre or type of game; you can use it to coordinate communications for any game imaginable!Starting Price: Free -
10
PagerDuty
PagerDuty
PagerDuty, Inc. (NYSE:PD) is a leader in digital operations management. In an always-on world, organizations of all sizes trust PagerDuty to help them deliver a perfect digital experience to their customers, every time. Teams use PagerDuty to identify issues and opportunities in real time and bring together the right people to fix problems faster and prevent them in the future. PagerDuty's ecosystem of over 350+ integrations, including Slack, Zoom, ServiceNow, AWS, Microsoft Teams, Salesforce, and more, enable teams to centralize their technology stack, get a holistic view of their operations, and optimize processes within their toolsets. -
11
Jira
Atlassian
Jira is the only project management tool you need to plan and track work across every team. Jira by Atlassian is the #1 software development tool for teams planning and building great products. Trusted by thousands of teams, Jira offers access to a wide range of tools for planning, tracking, and releasing world-class software, capturing and organizing issues, assigning work, and following team activity. It also integrates with leading developer tools for end-to-end traceability. From short projects, to large cross-functional programs, Jira helps break big ideas down into achievable steps. Organize work, create milestones, map dependencies and more. Link work to goals so everyone can see how their work contributes to company objectives and stay aligned to what’s important. Your next move, suggested by AI. Atlassian Intelligence takes your big ideas and automatically suggests the tasks to help get it done.Starting Price: Free -
12
GitHub
GitHub
GitHub is the world’s most secure, most scalable, and most loved developer platform. Join millions of developers and businesses building the software that powers the world. Build with the world’s most innovative communities, backed by our best tools, support, and services. If you manage multiple contributors , there’s a free option: GitHub Team for Open Source. We also run GitHub Sponsors, where we help fund your work. The Pack is back. We’ve partnered up to give students and teachers free access to the best developer tools—for the school year and beyond. Work for a government-recognized nonprofit, association, or 501(c)(3)? Get a discounted Organization account on us.Starting Price: $7 per month -
13
MongoDB
MongoDB
MongoDB is a general purpose, document-based, distributed database built for modern application developers and for the cloud era. No database is more productive to use. Ship and iterate 3–5x faster with our flexible document data model and a unified query interface for any use case. Whether it’s your first customer or 20 million users around the world, meet your performance SLAs in any environment. Easily ensure high availability, protect data integrity, and meet the security and compliance standards for your mission-critical workloads. An integrated suite of cloud database services that allow you to address a wide variety of use cases, from transactional to analytical, from search to data visualizations. Launch secure mobile apps with native, edge-to-cloud sync and automatic conflict resolution. Run MongoDB anywhere, from your laptop to your data center.Starting Price: Free -
14
Telegram
Telegram
Telegram messages are heavily encrypted and can self-destruct. Telegram lets you access your messages from multiple devices. Telegram delivers messages faster than any other application. Telegram servers are spread worldwide for security and speed. Telegram has an open API and protocol free for everyone. Telegram is free forever. No ads. No subscription fees. Telegram servers are spread worldwide for security and speed. Telegram has an open API and protocol free for everyone. Telegram is free forever. No ads. No subscription fees. Telegram keeps your messages safe from hacker attacks. Telegram has no limits on the size of your media and chats. Help make messaging safe again – spread the word about Telegram. -
15
Amazon Web Services (AWS)
Amazon
Amazon Web Services (AWS) is the world’s most comprehensive cloud platform, trusted by millions of customers across industries. From startups to global enterprises and government agencies, AWS provides on-demand solutions for compute, storage, networking, AI, analytics, and more. The platform empowers organizations to innovate faster, reduce costs, and scale globally with unmatched flexibility and reliability. With services like Amazon EC2 for compute, Amazon S3 for storage, SageMaker for AI/ML, and CloudFront for content delivery, AWS covers nearly every business and technical need. Its global infrastructure spans 120 availability zones across 38 regions, ensuring resilience, compliance, and security. Backed by the largest community of customers, partners, and developers, AWS continues to lead the cloud industry in innovation and operational expertise. -
16
Microsoft Azure
Microsoft
Microsoft's Azure is a cloud computing platform that allows for rapid and secure application development, testing and management. Azure. Invent with purpose. Turn ideas into solutions with more than 100 services to build, deploy, and manage applications—in the cloud, on-premises, and at the edge—using the tools and frameworks of your choice. Continuous innovation from Microsoft supports your development today, and your product visions for tomorrow. With a commitment to open source, and support for all languages and frameworks, build how you want, and deploy where you want to. On-premises, in the cloud, and at the edge—we’ll meet you where you are. Integrate and manage your environments with services designed for hybrid cloud. Get security from the ground up, backed by a team of experts, and proactive compliance trusted by enterprises, governments, and startups. The cloud you can trust, with the numbers to prove it. -
17
Facebook
Meta
Facebook is the world's largest social network. We build technologies that help people connect with friends and family, find communities, and grow businesses. From fundraising to offering life-saving help in a Facebook post or signing up to donate blood, we’re inspired by the ways people show up for each other in times of need. The Facebook app helps you connect with friends, family and communities of people who share your interests. Connecting with your friends and family as well as discovering new ones is easy with features like Groups, Watch and Marketplace.Starting Price: Free -
18
Datadog
Datadog
Datadog is the monitoring, security and analytics platform for developers, IT operations teams, security engineers and business users in the cloud age. Our SaaS platform integrates and automates infrastructure monitoring, application performance monitoring and log management to provide unified, real-time observability of our customers' entire technology stack. Datadog is used by organizations of all sizes and across a wide range of industries to enable digital transformation and cloud migration, drive collaboration among development, operations, security and business teams, accelerate time to market for applications, reduce time to problem resolution, secure applications and infrastructure, understand user behavior and track key business metrics.Starting Price: $15.00/host/month -
19
Tableau
Salesforce
Tableau, now enhanced with AI-powered capabilities and integrated with Salesforce, is an advanced analytics platform that helps businesses turn data into actionable insights. With Tableau Next, users can unlock the full potential of their data by accessing trusted AI-driven analytics. Whether deployed in the cloud, on-premises, or natively within Salesforce CRM, Tableau enables seamless data integration, powerful visualizations, and collaboration. The platform is designed to support organizations of all sizes in making data-driven decisions, while fostering a Data Culture through easy-to-use, intuitive tools for analysts, business leaders, IT leaders, and developers alike.Starting Price: $75/user/month -
20
SQLite
SQLite
SQLite is a C-language library that implements a small, fast, self-contained, high-reliability, full-featured, SQL database engine. SQLite is the most used database engine in the world. SQLite is built into all mobile phones and most computers and comes bundled inside countless other applications that people use every day. SQLite is an in-process library that implements a self-contained, serverless, zero-configuration, transactional SQL database engine. The code for SQLite is in the public domain and is thus free for use for any purpose, commercial or private. SQLite is the most widely deployed database in the world with more applications than we can count, including several high-profile projects.Starting Price: Free -
21
MySQL
Oracle
MySQL is the world's most popular open source database. With its proven performance, reliability, and ease-of-use, MySQL has become the leading database choice for web-based applications, used by high profile web properties including Facebook, Twitter, YouTube, and all five of the top five websites*. Additionally, it is an extremely popular choice as embedded database, distributed by thousands of ISVs and OEMs.Starting Price: Free -
22
Opsgenie
Atlassian
Stay aware and in control of all Dev and Ops incidents. Notify the right people, reduce response time, and avoid alert fatigue. Opsgenie is a modern incident management platform that ensures critical incidents are never missed, and actions are taken by the right people in the shortest possible time. Opsgenie receives alerts from your monitoring systems and custom applications and categorizes each alert based on importance and timing. On-call schedules ensure the right people are notified through multiple communication channels including voice calls, email, SMS, and push messages on mobile devices. If an alert is not acknowledged, Opsgenie automatically escalates it, ensuring the incident gets the needed attention. Sign up for an instant free trial.Starting Price: $9 per user per month -
23
Snowflake
Snowflake
Snowflake is a comprehensive AI Data Cloud platform designed to eliminate data silos and simplify data architectures, enabling organizations to get more value from their data. The platform offers interoperable storage that provides near-infinite scale and access to diverse data sources, both inside and outside Snowflake. Its elastic compute engine delivers high performance for any number of users, workloads, and data volumes with seamless scalability. Snowflake’s Cortex AI accelerates enterprise AI by providing secure access to leading large language models (LLMs) and data chat services. The platform’s cloud services automate complex resource management, ensuring reliability and cost efficiency. Trusted by over 11,000 global customers across industries, Snowflake helps businesses collaborate on data, build data applications, and maintain a competitive edge.Starting Price: $2 compute/month -
24
SendGrid
Twilio SendGrid
Email is the backbone of your customer engagement. The Twilio SendGrid Email API is the email service trusted by developers and marketers for time-savings, scalability, and delivery expertise. Our flexible Email API and proprietary Mail Transfer Agent (MTA), intuitive console, powerful features, and email experts make it easy to ensure all your email gets delivered in seconds and without interruption. Monitor engagement data and optimize deliverability with our Deliverability Insights dashboard. Whether you’re sending 100 emails or 100 billion, SendGrid is built to scale. Integrate with our RESTful APIs and SMTP in minutes, with sample code libraries available in 7 languages, interactive docs, templates, and more.Starting Price: $19.95 per month -
25
Segment
Twilio
Twilio Segment’s Customer Data Platform (CDP) provides companies with the data foundation that they need to put their customers at the heart of every decision. Per IDC, it’s the #1 CDP in worldwide market share four years in a row (2019-2022). Using Twilio Segment, companies can collect, unify and route their customer data into any system where it’s needed to better understand their customers and create seamless, compelling experiences in real-time. Over 25,000 companies use Twilio Segment to make real-time decisions, accelerate growth and deliver world-class customer experiences.Starting Price: $120 per month -
26
SQL Server
Microsoft
Intelligence and security are built into Microsoft SQL Server 2019. You get extras without extra cost, along with best-in-class performance and flexibility for your on-premises needs. Take advantage of the efficiency and agility of the cloud by easily migrating to the cloud without changing code. Unlock insights and make predictions faster with Azure. Develop using the technology of your choice, including open source, backed by Microsoft's innovations. Easily integrate data into your apps and use a rich set of cognitive services to build human-like intelligence across any scale of data. AI is native to the data platform—you can unlock insights faster from all your data, on-premises and in the cloud. Combine your unique enterprise data and the world's data to build an intelligence-driven organization. Work with a flexible data platform that gives you a consistent experience across platforms and gets your innovations to market faster—you can build your apps and then deploy anywhere.Starting Price: Free -
27
Jenkins
Jenkins
The leading open source automation server, Jenkins provides hundreds of plugins to support building, deploying and automating any project. As an extensible automation server, Jenkins can be used as a simple CI server or turned into the continuous delivery hub for any project. Jenkins is a self-contained Java-based program, ready to run out-of-the-box, with packages for Windows, Linux, macOS and other Unix-like operating systems. Jenkins can be easily set up and configured via its web interface, which includes on-the-fly error checks and built-in help. With hundreds of plugins in the Update Center, Jenkins integrates with practically every tool in the continuous integration and continuous delivery toolchain. Jenkins can be extended via its plugin architecture, providing nearly infinite possibilities for what Jenkins can do. Jenkins can easily distribute work across multiple machines, helping drive builds, tests and deployments across multiple platforms faster. -
28
Redis
Redis Labs
Redis Labs: home of Redis. Redis Enterprise is the best version of Redis. Go beyond cache; try Redis Enterprise free in the cloud using NoSQL & data caching with the world’s fastest in-memory database. Run Redis at scale, enterprise grade resiliency, massive scalability, ease of management, and operational simplicity. DevOps love Redis in the Cloud. Developers can access enhanced data structures, a variety of modules, and rapid innovation with faster time to market. CIOs love the confidence of working with 99.999% uptime best in class security and expert support from the creators of Redis. Implement relational databases, active-active, geo-distribution, built in conflict distribution for simple and complex data types, & reads/writes in multiple geo regions to the same data set. Redis Enterprise offers flexible deployment options, cloud on-prem, & hybrid. Redis Labs: home of Redis. Redis JSON, Redis Java, Python Redis, Redis on Kubernetes & Redis gui best practices.Starting Price: Free -
29
Elasticsearch
Elastic
Elastic is a search company. As the creators of the Elastic Stack (Elasticsearch, Kibana, Beats, and Logstash), Elastic builds self-managed and SaaS offerings that make data usable in real time and at scale for search, logging, security, and analytics use cases. Elastic's global community has more than 100,000 members across 45 countries. Since its initial release, Elastic's products have achieved more than 400 million cumulative downloads. Today thousands of organizations, including Cisco, eBay, Dell, Goldman Sachs, Groupon, HP, Microsoft, Netflix, The New York Times, Uber, Verizon, Yelp, and Wikipedia, use the Elastic Stack, and Elastic Cloud to power mission-critical systems that drive new revenue opportunities and massive cost savings. Elastic has headquarters in Amsterdam, The Netherlands, and Mountain View, California; and has over 1,000 employees in more than 35 countries around the world. -
30
Apache Cassandra
Apache Software Foundation
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra's support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages. -
31
Apache Hive
Apache Software Foundation
The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive. Apache Hive is an open source project run by volunteers at the Apache Software Foundation. Previously it was a subproject of Apache® Hadoop®, but has now graduated to become a top-level project of its own. We encourage you to learn about the project and contribute your expertise. Traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like queries (HiveQL) into the underlying Java without the need to implement queries in the low-level Java API. -
32
Alibaba Cloud
Alibaba
As a business unit of Alibaba Group (NYSE: BABA), Alibaba Cloud provides a comprehensive suite of global cloud computing services to power both our international customers’ online businesses and Alibaba Group’s own e-commerce ecosystem. In January 2017, Alibaba Cloud became the official Cloud Services Partner of the International Olympic Committee. By harnessing, and improving on, the latest cloud technology and security systems, we tirelessly work towards our vision - to make it easier for you to do business anywhere, with anyone in the world. Alibaba Cloud provides cloud computing services for large and small businesses, individual developers, and the public sector in over 200 countries and regions. -
33
Yandex
Yandex
Traditionally, the task of any search engine was to find information on the internet. But now that role has expanded. The internet can no longer be seen as separate from the reality surrounding us, and search engines now have to look for all kinds of things, not just online but all over the offline world as well. And they don’t just look for things and find them – they also give helpful suggestions in any real-life situation. Contemporary search understands the desires of every individual user, as well as the reality in which the user exists. That’s why it can give suitable suggestions for each person, individually – including what to read, where to go for a meal, what music to listen to, how to get home in the fastest possible way, where to book the cheapest flights, and much more. Search always was and still remains Yandex’s core product, but now we have a broader understanding of its role: that of a universal aid, an assistant, a guide to everything that surrounds us.Starting Price: Free -
34
Plexus
Plexus Software
Flexible, customizable, and cloud-based, Plexus is a modern and mature business process and field management software solution designed to meet the needs of field service companies. Combining estimating, scheduling, and job costing capabilities with a robust set of features for field management, Plexus enables real-time capacity overviews that are crucial to communicating scheduling changes to employees as well as understanding staffing limits. Plus, the solution's invoicing and billing features also allow for a pogressive and collective invoicing of various job orders while delivering easy onsite payment capture and signature. -
35
Fivetran
Fivetran
Fivetran is a leading data integration platform that centralizes an organization’s data from various sources to enable modern data infrastructure and drive innovation. It offers over 700 fully managed connectors to move data automatically, reliably, and securely from SaaS applications, databases, ERPs, and files to data warehouses and lakes. The platform supports real-time data syncs and scalable pipelines that fit evolving business needs. Trusted by global enterprises like Dropbox, JetBlue, and Pfizer, Fivetran helps accelerate analytics, AI workflows, and cloud migrations. It features robust security certifications including SOC 1 & 2, GDPR, HIPAA, and ISO 27001. Fivetran provides an easy-to-use, customizable platform that reduces engineering time and enables faster insights. -
36
IBM Cloudant
IBM
IBM Cloudant® is a distributed database that is optimized for handling heavy workloads that are typical of large, fast-growing web and mobile apps. Available as an SLA-backed, fully managed IBM Cloud™ service, Cloudant elastically scales throughput and storage independently. Instantly deploy an instance, create databases and independently scale throughput capacity and data storage to meet your application requirements. Encrypt all data, with optional user-defined encryption key management through IBM Key Protect, and integrate with IBM Identity and Access Management. Get continuous availability as Cloudant distributes data across availability zones and 6 regions for app performance and disaster recovery requirements. Get continuous availability as Cloudant distributes data across availability zones and 6 regions for app performance and disaster recovery requirements. -
37
HashiCorp Vault
HashiCorp
Secure, store and tightly control access to tokens, passwords, certificates, encryption keys for protecting secrets and other sensitive data using a UI, CLI, or HTTP API. Secure applications and systems with machine identity and automate credential issuance, rotation, and more. Enable attestation of application and workload identity, using Vault as the trusted authority. Many organizations have credentials hard coded in source code, littered throughout configuration files and configuration management tools, and stored in plaintext in version control, wikis, and shared volumes. Safeguarding and ensuring that a credentials isn’t leaked, or in the likelihood it is, that the organization can quickly revoke access and remediate, is a complex problem to solve. -
38
Apache Druid
Druid
Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures. -
39
Datakin
Datakin
Instantly reveal the order hidden within your complex data world, and always know exactly where to look for answers. Datakin automatically traces data lineage, showing your entire data ecosystem in a rich visual graph. It clearly illustrates the upstream and downstream relationships for each dataset. The Duration tab summarizes a job’s performance in a Gantt-style chart along with its upstream dependencies, making it easy to find bottlenecks. When you need to pinpoint the exact moment of a breaking change, the Compare tab shows how your jobs and datasets have changed between runs. Sometimes jobs that run successfully produce bad output. The Quality tab surfaces critical data quality metrics, showing how they change over time so anomalies become obvious. Datakin helps you find the root cause of issues quickly – and prevent new ones from occurring.Starting Price: $2 per month -
40
Databricks Data Intelligence Platform
Databricks
The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker. -
41
Qubole
Qubole
Qubole is a simple, open, and secure Data Lake Platform for machine learning, streaming, and ad-hoc analytics. Our platform provides end-to-end services that reduce the time and effort required to run Data pipelines, Streaming Analytics, and Machine Learning workloads on any cloud. No other platform offers the openness and data workload flexibility of Qubole while lowering cloud data lake costs by over 50 percent. Qubole delivers faster access to petabytes of secure, reliable and trusted datasets of structured and unstructured data for Analytics and Machine Learning. Users conduct ETL, analytics, and AI/ML workloads efficiently in end-to-end fashion across best-of-breed open source engines, multiple formats, libraries, and languages adapted to data volume, variety, SLAs and organizational policies. -
42
Exasol
Exasol
With an in-memory, columnar database and MPP architecture, you can query billions of rows in seconds. Queries are distributed across all nodes in a cluster, providing linear scalability for more users and advanced analytics. MPP, in-memory, and columnar storage add up to the fastest database built for data analytics. With SaaS, cloud, on premises and hybrid deployment options you can analyze data wherever it lives. Automatic query tuning reduces maintenance and overhead. Seamless integrations and performance efficiency gets you more power at a fraction of normal infrastructure costs. Smart, in-memory query processing allowed this social networking company to boost performance, processing 10B data sets a year. A single data repository and speed engine to accelerate critical analytics, delivering improved patient outcome and bottom line. -
43
Neo4j
Neo4j
Neo4j’s graph data platform is purpose-built to leverage not only data but also data relationships. Using Neo4j, developers build intelligent applications that traverse today's large, interconnected datasets in real time. Powered by a native graph storage and processing engine, Neo4j’s graph database delivers an intuitive, flexible and secure database for unique, actionable insights. -
44
Apache Drill
The Apache Software Foundation
Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage -
45
PostgreSQL
PostgreSQL Global Development Group
PostgreSQL is a powerful, open-source object-relational database system with over 30 years of active development that has earned it a strong reputation for reliability, feature robustness, and performance. There is a wealth of information to be found describing how to install and use PostgreSQL through the official documentation. The open-source community provides many helpful places to become familiar with PostgreSQL, discover how it works, and find career opportunities. Learm more on how to engage with the community. The PostgreSQL Global Development Group has released an update to all supported versions of PostgreSQL, including 15.1, 14.6, 13.9, 12.13, 11.18, and 10.23. This release fixes 25 bugs reported over the last several months. This is the final release of PostgreSQL 10. PostgreSQL 10 will no longer receive security and bug fixes. If you are running PostgreSQL 10 in a production environment, we suggest that you make plans to upgrade. -
46
Presto
Presto Foundation
Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. For data engineers who struggle with managing multiple query languages and interfaces to siloed databases and storage, Presto is the fast and reliable engine that provides one simple ANSI SQL interface for all your data analytics and your open lakehouse. Different engines for different workloads means you will have to re-platform down the road. With Presto, you get 1 familar ANSI SQL language and 1 engine for your data analytics so you don't need to graduate to another lakehouse engine. Presto can be used for interactive and batch workloads, small and large amounts of data, and scales from a few to thousands of users. Presto gives you one simple ANSI SQL interface for all of your data in various siloed data systems, helping you join your data ecosystem together. -
47
Celery
Celery
Celery is a platform for pre-orders and beyond. People use Celery for pre-orders, custom crowdfunding, and e-commerce. Use it to win at presales with high-conversion checkout and checkout as a seamless extension of your brand. -
48
Apache Spark
Apache Software Foundation
Apache Spark™ is a unified analytics engine for large-scale data processing. Apache Spark achieves high performance for both batch and streaming data, using a state-of-the-art DAG scheduler, a query optimizer, and a physical execution engine. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python, R, and SQL shells. Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application. Spark runs on Hadoop, Apache Mesos, Kubernetes, standalone, or in the cloud. It can access diverse data sources. You can run Spark using its standalone cluster mode, on EC2, on Hadoop YARN, on Mesos, or on Kubernetes. Access data in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and hundreds of other data sources. -
49
Census
Census
Census is the operational analytics platform that syncs your data warehouse with all your favorite apps. Get your customer success, sales & marketing teams on the same page by keeping customer data in sync. No engineering favors required. Census automatically publishes SQL & dbt models from your warehouse without a line of code. Stop deciphering external APIs and maintaining custom scripts, just focus on the business outcomes. You don't want "yet another source of truth" when everything is already in your warehouse. Census works on top of your existing infrastructure. Just pick a destination app, map the data, and voila, you are done. Your data shouldn't be stuck in quarterly reports. Census empowers everyone in your business to take action. Live metrics in every app means better business operations, which leads to happier users & more revenue. -
50
lakeFS
Treeverse
lakeFS enables you to manage your data lake the way you manage your code. Run parallel pipelines for experimentation and CI/CD for your data. Simplifying the lives of engineers, data scientists and analysts who are transforming the world with data. lakeFS is an open source platform that delivers resilience and manageability to object-storage based data lakes. With lakeFS you can build repeatable, atomic and versioned data lake operations, from complex ETL jobs to data science and analytics. lakeFS supports AWS S3, Azure Blob Storage and Google Cloud Storage (GCS) as its underlying storage service. It is API compatible with S3 and works seamlessly with all modern data frameworks such as Spark, Hive, AWS Athena, Presto, etc. lakeFS provides a Git-like branching and committing model that scales to exabytes of data by utilizing S3, GCS, or Azure Blob for storage.