Ingest data from SQL Server, Salesforce, and Workday with LakeFlow Connect (2024)

We’re excited to announce the Public Preview of LakeFlow Connect for SQL Server, Salesforce, and Workday. These ingestion connectors enable simple and efficient ingestion from databases and enterprise apps—powered by incremental data processing and smart optimizations under the hood. LakeFlow Connect is also native to the Data Intelligence Platform, so it offers both serverless compute and Unity Catalog governance. Ultimately, this means organizations can spend less time moving their data and more time getting value from it.

More broadly, this is a key step towards realizing the future of data engineering on Databricks with LakeFlow: the unified solution for ingestion, transformation and orchestration that we announced at Data + AI Summit. LakeFlow Connect will work seamlessly with LakeFlow Pipelines for transformation and LakeFlow Jobs for orchestration. Together, these will enable customers to deliver fresher and higher-quality data to their businesses.

Challenges in data ingestion

Organizations have a wide range of data sources: enterprise apps, databases, message buses, cloud storage, and more. To address the nuances of each source, they often build and maintain custom ingestion pipelines, which introduces several challenges.

  • Complex configuration and maintenance: It’s difficult to connect to databases, especially without impacting the source system. It’s also hard to learn and keep up with ever-changing application APIs. Therefore, custom pipelines require a lot of effort to build, optimize, and maintain—which can, in turn, limit performance and increase costs.
  • Dependencies on specialized teams: Given this complexity, ingestion pipelines often require highly skilled data engineers. This means that data consumers (e.g., HR analysts, and financial planners) depend on specialized engineering teams, thus limiting productivity and innovation.
  • Patchwork solutions with limited governance: With a patchwork of pipelines, it’s hard to build governance, access control, observability, and lineage. This opens the door to security risks and compliance challenges, as well as difficulties in troubleshooting any issues.

LakeFlow Connect: simple and efficient ingestion for every team

LakeFlow Connect addresses these challenges so that any practitioner can easily build incremental data pipelines at scale.

LakeFlow Connect is simple to configure and maintain

To start, the connectors take as little as just a few steps to set up. Moreover, once you’ve set up a connector, it’s fully managed by Databricks. This lowers the costs of maintenance. It also means that ingestion no longer requires specialized knowledge—and that data can be democratized across your organization.

Ingest data from SQL Server, Salesforce, and Workday with LakeFlow Connect (1)

“The Salesforce connector was simple to set up and provides the ability to sync data to our data lake. This has saved a great deal of development time and ongoing support time making our migration faster”

— Martin Lee, Technology Lead Software Engineer, Ruffer

LakeFlow Connect is efficient

Under the hood, LakeFlow Connect pipelines are built on Delta Live Tables, which are designed for efficient incremental processing. Moreover, many of the connectors read and write only the data that’s changed in the source system. Finally, we leverage Arcion’s source-specific technology to optimize each connector for performance and reliability while also limiting impact on the source system.

Because ingestion is just the first step, we don’t stop there. You can also construct efficient materialized views that incrementally transform your data as it works its way through the medallion architecture. Specifically, Delta Live Tables can process updates to your views incrementally—only updating the rows that need to change rather than fully recomputing all rows. Over time, this can significantly improve the performance of your transformations, which in turn makes your end-to-end ETL pipelines just that much more efficient.

"The connector enhances our ability to transfer data by providing a seamless and robust integration between Salesforce and Databricks. [...] The time required to extract and prepare data has been reduced from approximately 3 hours to just 30 minutes”

— Amber Howdle-Fitton, Data and Analytics Manager, Kotahi

LakeFlow Connect is native to the Data Intelligence Platform

LakeFlow Connect is fully integrated with the rest of your Databricks tooling. Like the rest of your data and AI assets, it's governed by Unity Catalog, powered by Delta Live Tables using serverless compute, and orchestrated with Databricks Workflows. This enables features like unified monitoring across your ingestion pipelines. Moreover, because it’s all part of the same platform, you can then use Databricks SQL, AI/BI and Mosaic AI to get the most out of your data.

​​"With Databricks' new LakeFlow Connector for SQL Server, we can eliminate [...] intermediary products between our source database and Databricks. This means faster data ingestion, reduced costs, and less effort spent configuring, maintaining, and monitoring third-party CDC solutions. This feature will greatly benefit us by streamlining our data pipeline."

— Kun Lee, Senior Director Database Administrator, CoStar

An exciting LakeFlow roadmap

The first wave of connectors can create SQL Server, Salesforce, and Workday pipelines via API. But this Public Preview is only the beginning. In the coming months, we plan to begin Private Previews of connectors to additional data sources, such as:

  • ServiceNow
  • Google Analytics 4
  • SharePoint
  • PostgreSQL
  • SQL Server on-premises

The roadmap also includes a deeper feature set for each connector. This may include:

  • UI for connector creation
  • Data lineage
  • SCD type 2
  • Robust schema evolution
  • Data sampling

More broadly, LakeFlow Connect is only the first component of LakeFlow. Later this year, we plan to preview LakeFlow Pipelines for transformation and LakeFlow Jobs for orchestration—the evolution of Delta Live Tables and Workflows, respectively. Once they’re available, they will not require any migration. The best way to prepare for these new additions is to start using Delta Live Tables and Workflows today.

Getting started with LakeFlow Connect

SQL Server connector: Supports ingestion from Azure SQL Database and AWS RDS for SQL Server, with incremental reads that use change data capture (CDC) and change tracking technology. Learn more about the SQL Server Connector.

Salesforce connector: Supports ingestion from Salesforce Sales Cloud, allowing you to join these CRM insights with data in the Data Intelligence Platform to deliver additional insights and more accurate predictions. Learn more about the Salesforce connector.

Workday connector: Supports ingestion from Workday Reports-as-a-Service (RaaS), allowing you to analyze and enrich your reports. Learn more about the Workday connector.

"The Salesforce connector provided in LakeFlow Connect has been crucial for us, enabling direct connections to our Salesforce databases and eliminating the need for an additional paid intermediate service."

— Amine Hadj-Youcef, Solution Architect, Engie

To get access to the preview, contact your Databricks account team.

Note that LakeFlow Connect uses serverless compute for Delta Live Tables. Therefore:

  • Serverless compute must be enabled in your account (see how to do so for Azure or AWS, and see a list of serverless-enabled regions for Azure or AWS)
  • Your workspace must be enabled for Unity Catalog.

For further guidance, refer to the LakeFlow Connect documentation.

Ingest data from SQL Server, Salesforce, and Workday with LakeFlow Connect (2024)

References

Top Articles
|23 SKINS|Jack O'sassin|Meow Skulls|Indiana Jones|Geralt of Rivia... | ID 214091627 | PlayerAuctions
Guide to Rockford Region Golf Courses
Creepshotorg
Navicent Human Resources Phone Number
Bubble Guppies Who's Gonna Play The Big Bad Wolf Dailymotion
Nyu Paralegal Program
Blanchard St Denis Funeral Home Obituaries
Jennette Mccurdy And Joe Tmz Photos
Academic Integrity
Craigslist Nj North Cars By Owner
Nikki Catsouras Head Cut In Half
Visustella Battle Core
Facebook Marketplace Charlottesville
Beau John Maloney Houston Tx
Nebraska Furniture Tables
WEB.DE Apps zum mailen auf dem SmartPhone, für Ihren Browser und Computer.
Walgreens San Pedro And Hildebrand
Crawlers List Chicago
How many days until 12 December - Calendarr
Winco Employee Handbook 2022
Craigslist St. Cloud Minnesota
2487872771
Sienna
Kabob-House-Spokane Photos
O'reilly's In Mathis Texas
Encore Atlanta Cheer Competition
Big Boobs Indian Photos
Gesichtspflege & Gesichtscreme
Martins Point Patient Portal
Revelry Room Seattle
Donald Trump Assassination Gold Coin JD Vance USA Flag President FIGHT CIA FBI • $11.73
Gina's Pizza Port Charlotte Fl
The Pretty Kitty Tanglewood
Weekly Math Review Q4 3
W B Crumel Funeral Home Obituaries
Covalen hiring Ai Annotator - Dutch , Finnish, Japanese , Polish , Swedish in Dublin, County Dublin, Ireland | LinkedIn
Missouri State Highway Patrol Will Utilize Acadis to Improve Curriculum and Testing Management
Planet Fitness Santa Clarita Photos
Sc Pick 4 Evening Archives
Daly City Building Division
Registrar Lls
Isabella Duan Ahn Stanford
Sand Castle Parents Guide
Santa Clara County prepares for possible ‘tripledemic,’ with mask mandates for health care settings next month
Unitedhealthcare Community Plan Eye Doctors
2294141287
DL381 Delta Air Lines Estado de vuelo Hoy y Historial 2024 | Trip.com
Craigslist Cars For Sale By Owner Memphis Tn
Diesel Technician/Mechanic III - Entry Level - transportation - job employment - craigslist
Craigslist Centre Alabama
7 National Titles Forum
Dr Seuss Star Bellied Sneetches Pdf
Latest Posts
Article information

Author: Horacio Brakus JD

Last Updated:

Views: 5646

Rating: 4 / 5 (51 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Horacio Brakus JD

Birthday: 1999-08-21

Address: Apt. 524 43384 Minnie Prairie, South Edda, MA 62804

Phone: +5931039998219

Job: Sales Strategist

Hobby: Sculling, Kitesurfing, Orienteering, Painting, Computer programming, Creative writing, Scuba diving

Introduction: My name is Horacio Brakus JD, I am a lively, splendid, jolly, vivacious, vast, cheerful, agreeable person who loves writing and wants to share my knowledge and understanding with you.