Who are Candidates for the All-In-One Data Stack of the Future Platforms?
Inspiration from Seattle Data Guy (who now lives in Denver)
Image source: Incorta
Hey Guys,
This is Data Science Learning Premium,
Benjamin Rogojan, the “Seattle Data Guy” is becoming increasingly a good source of information for data engineers and data scientists, he also has a stellar YouTube channel.
In a recent LinkedIn post he really perked my interest.
On the Modern Data Stack
This is a direct quote:
Piecing together a “modern data stack'' is far from the only option.
The “all-in-one data stack” is starting to come back into vogue. I have discussed some of these tools both with clients as well as VCs who have been paying attention to the recent customers who are being acquired by these new all-in-one data solutions.
In my newsletter this week I will be diving a little deeper in some of these options including Rivery, Keebola, Mozart Data, Nexla and Incorta.
Is the Era of All-In-One Data Stack Coming?
He cites companies such as Rivery, Keebola, Mozart Data, Nexla and Incorta. He even created a graph about them:
I’m very interested in the new data platforms that are coming. I’m forever thinking of how Snowflake, Databricks, Qubole, Vertica Systems, Datadog and so many others will evolve.
So what about the new breed of all-in-one data stack companies?
The Modern Data Stack (MDS) is centered around an ecosystem of tools businesses use to collect, move, store, transform, analyze, and operationalize their data.
Ben in his LinkedIn post goes on:
The modern data stack was all the rage in 2020 and 2021, but in the late parts of 2021 and most of 2022 people started to ask questions.
Is the modern data stack even modern?
Isn’t it just a piecemeal of components from solutions we have known forever like SAP or Informatica.
Isn’t it just an unbundled version of Airflow?
Clearly there is a sense that how companies use data and do machine learning will keep transforming even as digital transformation, software and A.I. continue to impact society, businesses and how we become a data-based civilization.
Let’s quickly review the companies Ben has identified ordered by their current funding levels:
Incorta
Incorta provides a unified data and analytics platform that makes it quick and easy to unlock the full potential of data from multiple complex source systems. By eliminating traditional data transformation, modeling and aggregation steps, it makes 100% of data instantly ready for analysis. Pre-packaged data apps for common business applications make it easy to prepare data for analysis.
It’s a startup founded by former Oracle executives who want to change the way we process large amounts of data.
Founded: 2013
CEO: Osama Elkady
A typical data project involves ETL (extract, transform, load). It’s a process that takes data out of one database, changes the data to make it compatible with the target database and adds it to the target database. A lead venture capitalist said: Incorta is poised to upend the data warehousing market with innovative technology that will end 30 years of archaic and slow data warehouse infrastructure.
Enable true business agility with a ludicrously fast and flexible data pipeline that makes 100% of your data from multiple data sources instantly ready for analysis w/ Incorta. - You can go from zero to analyzing data in days or just a few weeks. Simply load your data, and start exploring.
Rivery
Unify your ELT pipelines, workflow orchestration, & data operations with Rivery's complete SaaS platform.
Whether you’re building out your data stack or transitioning to the cloud, managing your data workflows to analyze your business can be a real challenge.
Developing an in-house solution requires valuable resources and upkeep, while integrating several tools adds new layers of complexity.
Rivery’s SaaS platform provides a fully-managed solution for data ingestion, data transformation, data orchestration, reverse ETL and more, with built-in support for your data operations development and deployment lifecycles.
Designed to be nimble for non-technical users and with advanced capabilities for experts, Rivery enables you to manage data workflows as the foundation of a modern data stack.
Founded: 2018 (New York with strong Israeli workforce presence)
CEO: Itamar Ben Hemo
Discover how Rivery enables data innovators from different industries to save time, automate processes, and unlock greater value from their data.
Rivery considers itself a DataOps Platform. Rivery is a SaaS DataOps platform that gives companies control over their organizational data through the ingestion, transformation, and orchestration of data processes. It was named “one to watch” by Snowflake’s modern marketing data stack.
Mozart Data
The modern data platform empowering anyone to easily centralize, organize, & analyze their data without engineering.
Mozart Data is the fastest way to set up scalable, reliable data infrastructure that doesn’t need to be maintained by you. Mozart Data’s all-in-one modern data platform empowers anyone to easily centralize, organize, and analyze their data without engineering resources.
Setting up a stack of data tools is a difficult exercise for any startup to undertake, involving picking and choosing among a wide variety of tools and approaches. Mozart Data’s founders have a background in data wrangling and have built a platform of tools that makes the decisions for you, especially designed for startups.
Being data-driven has never been easier
Your full-service modern data platform for centralizing, organizing,
and analyzing your data. Get set up in an hour and start working with
our data analysts.
Mozart has built the pieces into the stack, while in others it recommends what it considers best-in-class as it has done by selecting Snowflake as the default choice for a data lake on the Mozart platform of tools.
I’m a bit excited by Mozart Data’s no-code side of things. Say goodbye to exporting CSVs of data and building countless pivot tables. Connect your data sources and pull data automatically with reliable no-code integrations.
Founded: 2020
CEO: Peter Fisherman
Their unique value proposition seems really useful. The idea is to have a data stack in a box of sorts that removes the complexity for startups that just want to get going quickly with a data platform.
Nexla
A company in Unified Data Ops. Nexla is a leader in unified data operations and a 2021 Gartner Cool Vendor.
Their platform makes it simple for anyone to create scalable data flows. Teams working with data get a no/low-code unified experience to integrate, transform, provision, and monitor data for any use case. Data users with varying skill levels work collaboratively to create ready to use data products. Organizations get zero-friction, governed, and agile data operations.
Nexla works with larger companies that work with data like Doordash. Nexla’s initial customers have been larger companies. The goal with this money is actually to expand the market by offering it to smaller organizations.
Founded: 2016
CEO: Saket Saurabh
DATA ENGINEERING AUTOMATION
Ready-to-Use Data for Everyone
Reliable, consistent, and trusted data products in the hands of every user, in the applications they use.
It’s never been easier to create and run high-performance, automated data operations for your company. With their Data Product approach anyone can discover, integrate, transform, deliver, and monitor all data, regardless of format and velocity – batch, streaming, real-time.
Unifying integration, preparation, and monitoring whether you need ETL, ELT, Reverse ETL, API Integration, or Streaming. CLI, SDK, and API for developers. No-code for everyone else.
Automated connectors and data products. No code, no wait. We start at connectors that auto-generate from configuration, and end with ready-to-use data products that bring data into the application you use in a format you need.
The market particularly in machine learning, has been heating up and the company has been seeing traction with customers like DoorDash, Instacart, Poshmark and Freshworks, among others.
Keebola
They are a data stack as a service company.
Keboola is a Prague based cloud-based data platform that helps clients combine, enhance and publish crucial information for their internal analytics projects and data products in a quick and easy fashion.
Operating and servicing clients for 8 years, they help a wide array of businesses from financial, travel, hospitality, retail and gaming industries, we help them significantly reduce or eliminate:
Time spent on repetitive maintenance tasks
• Adoption time and learning curves needed for outdated systems
• Drawn-out menial responsibilities which detract from efficiency Building on knowledge of programming that is available in the market such as SQL, R, Python etc…., we allow clients achieve unparalleled time to value ratio with all Keboola Connection implementations. Majority of our customers are completely self-serving from the inception of their project. We partner with professional services companies to build the right solutions for our clients. Our Developer partners build apps that seamlessly integrate their services and algorithms into our structure, making functions like predictive analytics and machine learning instantly available to our customers with no integration work required. We cast technology alliances with platforms that help our clients consume the data and insights in the ways most suitable for their particular use cases.
Founded: 2008
CEO: Pavel Doležal
Keboola runs a complete data stack as a service, all from one place. Everything works seamlessly together and the process is fully observable.
Automate and analyze
Keboola enables data engineers, data analysts, and analytics engineers to collaborate on analytics and automation as a team, from extraction, transformation, data management, and pipeline orchestration to reverse ETL.
Anyways guys I hope you enjoyed the brief summary, I’m going to start digging into these kinds of companies more.
Complementary to this post is Seattle Data guy's original and more technical review: https://seattledataguy.substack.com/p/the-next-generation-of-all-in-one