Delta Lake’s cover photo
Delta Lake

Delta Lake

Software Development

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture.

About us

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python. Delta Lake is an independent open-source project and not controlled by any single company. To emphasize this we joined the Delta Lake Project in 2019, which is a sub-project of the Linux Foundation Projects.

Website
https://delta.io
Industry
Software Development
Company size
11-50 employees
Headquarters
San Francisco
Type
Partnership
Founded
2019
Specialties
Delta Lake, Apache Spark, PrestoDB, Trino, Hive, Apache Flink, Apache Beam, Apache Pulsar, Rust, Scala, Java, Python, and Ruby

Locations

Employees at Delta Lake

Updates

  • 📣 Join us for our next 𝗢𝗽𝗲𝗻 𝗟𝗮𝗸𝗲𝗵𝗼𝘂𝘀𝗲 + 𝗔𝗜 webinar on October 30 at 9AM PT! Most lineage graphs are overly complex and lack meaningful context, and years of chasing “total visibility” proved that more metadata doesn’t create more clarity. 🔎 Collecting metadata isn’t the problem—understanding it is. This session explains why the future of observability isn’t another graph, but a reasoning layer that connects lineage and telemetry into one continuous context—so teams can detect, diagnose, and adapt in real time. Register ➡️ https://lnkd.in/eihgBzdV 🎙️ Featuring Willy Lulciuc, Co-Founder & CEO of oleander 💬 Hosted by Lisa N. Cao, Developer Relations at Databricks #opensource #metadata #openlakehouse #ai #datalineage #oss

    This content isn’t available here

    Access this content and more in the LinkedIn app

  • Come learn, connect, and celebrate the open source community in Mountain View on Nov 13 at the 𝗢𝗽𝗲𝗻 𝗟𝗮𝗸𝗲𝗵𝗼𝘂𝘀𝗲 + 𝗔𝗜 𝗠𝗶𝗻𝗶 𝗦𝘂𝗺𝗺𝗶𝘁! 🚀 Explore AI infrastructure, interoperable data systems, and more — plus lunch, swag, and great conversations. 🔗 RSVP on Luma: https://luma.com/OLMS-1113 #opensource #oss #deltalake #unitycatalog #apachespark #apacheiceberg #ai

    View organization page for Unity Catalog

    13,636 followers

    The open source and data community is coming together in Mountain View, CA! 💥 Join us Nov 13 (12–4:30PM PT) for the 𝗢𝗽𝗲𝗻 𝗟𝗮𝗸𝗲𝗵𝗼𝘂𝘀𝗲 + 𝗔𝗜 𝗠𝗶𝗻𝗶 𝗦𝘂𝗺𝗺𝗶𝘁, featuring two tracks packed with insights on AI infrastructure, context engineering, and the future of interoperable data systems.  Lunch, swag, and great conversations included! Stick around for the Apache Spark Happy Hour (5–6:30PM PT) — the perfect way to wrap up a day of learning and community. 😎 🎟️ RSVP here: https://luma.com/OLMS-1113 #opensource #oss #unitycatalog #apachespark #deltalake #apacheiceberg #ai

    • No alternative text description for this image
  • One of the most powerful new capabilities in Delta Lake 4.0 is support for collations, giving you much finer control over how text is compared and sorted. ✅ In this clip, Youssef Mrini explains how to enable collations: 🔹 Define a default collation at the table level 🔹 Customize collations for individual columns 🔹 Use ALTER TABLE or ALTER COLUMN to add them to existing tables Whether you’re handling multilingual datasets or need case-insensitive searches, collations make your queries more accurate and flexible — with minimal changes to your code. 🎥 Watch the full webinar to explore more: https://lnkd.in/eKgJ3tXM #opensource #oss #deltalake #linuxfoundation Scott Haines

  • One week to go! 🎉 Curious how data engineering design patterns translate to Delta Lake, and how Delta fits into streaming architectures? Join Scott Haines (Buf) and Bartosz Konieczny (waitingforcode.com) for a practical walkthrough of the patterns, features, and best practices powering reliable streaming with Delta Lake. 🗓️ Tuesday, Oct 14 🕝 9:00AM PT 🔗 Register: https://lnkd.in/eSRiXpDF #DeltaLake #opensource #oss #dataarchitecture #dataengineering

    View organization page for Delta Lake

    65,161 followers

    Are you wondering if general concepts like data engineering design patterns can help you learn about #DeltaLake? Or, if it's possible to leverage Delta Lake within your streaming data architecture? In this webinar, Scott Haines and Bartosz Konieczny will answer these two questions. Scott, who gained streaming expertise at Yahoo, Twilio, and Nike, will share with you best practices for leveraging Delta Lake as a component of your streaming architecture. ✅ Bartosz, who recently published Data Engineering Design Patterns, will reverse-engineer a few of these design patterns to explain which Delta Lake features make everything tick. 🗓️ Tuesday, Oct 14 🕝 9AM PT Don't miss it! 🔗 Register today: https://lnkd.in/eSRiXpDF #opensource #oss #dataarchitecture #dataengineering

    Diving into Streaming Data Design Patterns for Delta Lake

    Diving into Streaming Data Design Patterns for Delta Lake

    www.linkedin.com

  • View organization page for Delta Lake

    65,161 followers

    Paris! 🇫🇷 The Open Lakehouse + AI Meetup is coming November 24, co-located with the Forward Data Conference. Hear from Youssef Mrini (Databricks) on the next era of open table formats, Alexandre BERGERE (DataGalaxy) on scaling usage analytics through Delta Sharing, & more! Connect with open source builders, explore hands-on architectures, and shape the future of data and AI. 📅 Monday, Nov 24 🕡 6:30–10 PM 📍 Maison Internationale, Paris 🔗 RSVP: https://lu.ma/OLM-1124 #openlakehouse #opensource #lakehouse #ai #deltalake #apachespark Hymaïa Yoann Benoit

    This content isn’t available here

    Access this content and more in the LinkedIn app

  • Exciting news — the Japanese edition of 𝗗𝗲𝗹𝘁𝗮 𝗟𝗮𝗸𝗲: 𝗧𝗵𝗲 𝗗𝗲𝗳𝗶𝗻𝗶𝘁𝗶𝘃𝗲 𝗚𝘂𝗶𝗱𝗲 will be released this November! 🎉 Great work by the team to expand the original guide, sharing new perspectives on why #Lakehouses are the backbone of #AI, and how community collaboration drives data innovation. 🙌 Stay tuned for more! #opensource #deltalake #lakehouse #oss

    View profile for Ryo H.

    Focusing on developing Data+AI products with Design & Engineering Approach on Lakehouse in Business Impact driven motions.

    『詳解 データレイクハウスアーキテクチャ』が2025/11/19に刊行(予定)されます! — Delta Lakeを使ったデータ+AI活用とガバナンス — 詳細はこちら→ https://lnkd.in/gAdAwUPJ 原著『Delta Lake: The Definitive Guide』は、オープンフォーマット「Delta Lake」の公式ガイドとなっていますが、オープンフォーマットの解説に留まらず、日本企業が本当に必要とする「データ+AIを持続的に活かすためのアーキテクチャとガバナンスの詳細」を多く解説しています。 具体的には、本書の前半では、Delta Lakeの基本原理を丁寧に解説し、後半ではデータレイクハウスが「AI時代の基盤」となる理由や、アーキテクチャ、メタデータ管理、Unity Catalogによるデータガバナンスとセキュリティ、AIエージェント時代に求められるデータ基盤の再構築について、アーキテクチャ思考の視点から掘り下げています。 新しい題名『詳解 データレイクハウスアーキテクチャ』には、この本が「単なるDelta Lakeのガイド」ではなく、“データ+AI基盤を正しく設計するための日本版リファレンス”であるという、監訳者チーム( Ryo H. , Satoshi Kuramitsu, Shunichiro TAKESHITA, Shotaro Kotani)の意図が込められています。 AIが生成し、判断し、意思決定を支援する時代において、データレイクハウスは重要なアーキテクチャに関するコンセプトです。Delta Lakeだけでなく、データレイクハウスのアーキテクチャを正しく理解することができる日本で初めての書籍になることを願っています! Special Thanks I’d like to express my deepest gratitude to Michael Armbrust, Matei Zaharia, Denny Lee, Tristen Wentling, Scott Haines, Prashanth Babu and the entire Databricks Delta Lake team for creating Delta Lake: The Definitive Guide — the foundation of this Japanese edition. Your work has not only inspired a generation of data engineers and architects, but also defined how enterprises around the world think about data reliability, openness, and AI readiness. It’s a true honor to bring your vision to Japan and to help more people understand the architecture and governance principles behind the modern Data + AI platform. Thank you for leading the world toward the Lakehouse future. #deltalake #lakehouse #databricks #engorgio

  • View organization page for Delta Lake

    65,161 followers

    Tomorrow, Oct 7 at 9AM PT—last chance to register! ➡️ 𝗙𝘂𝗻𝗰𝘁𝗶𝗼𝗻𝘀 𝘁𝗼 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁𝘀: 𝗥𝗲𝗶𝗺𝗮𝗴𝗶𝗻𝗶𝗻𝗴 𝘁𝗵𝗲 𝗟𝗮𝗸𝗲𝗵𝗼𝘂𝘀𝗲 𝗳𝗼𝗿 𝗮𝗻 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗙𝘂𝘁𝘂𝗿𝗲 🎟️ Register: https://luma.com/OLAI-107 Modern data teams still battle fragmented interfaces and brittle handoffs. What’s missing? Simple, uniform ergonomics: human‑friendly, machine‑usable. Ciro Greco and Jacopo Tagliabue from bauplan will show how function‑based execution, Git‑for‑Data, and full programmability make lakehouses ready for real AI agents—reducing complexity now and enabling safe, autonomous data ops tomorrow. ✅ Agents need more than query-writing—they must safely manage ingestion, testing, and deployment in reproducible environments. Otherwise, it’s just legacy rebranded. Join us and see the path forward! 🤝 #opensource #lakehouse #oss #agents #aiagents

    View organization page for Delta Lake

    65,161 followers

    📣 Mark your calendar for Tuesday, October 7 at 9AM PT for the next Open Lakehouse + AI webinar: “From Functions to AI Agents: Reimagining the Lakehouse for an Agentic Future!” 🚀 Lakehouses promise unified analytics, ML, and governance—but current stacks leave teams juggling fragmented tools and interfaces. What’s needed are simple, uniform environments that are both human-friendly and machine-usable. ✅ The next step is agentic automation: AI agents that manage ingestion, testing, and deployment. To work safely, agents require isolated, deterministic, and reproducible environments—something most legacy stacks lack. Join bauplan founders Ciro Greco and Jacopo Tagliabue to explore how function-based execution, Git-for-Data semantics, and programmable abstractions make lakehouses agent-ready—cutting complexity and freeing teams for real innovation. 👏 🔗 Register here: https://luma.com/OLAI-107 #openlakehouse #opensource #oss #aiagents Lisa N. Cao

    This content isn’t available here

    Access this content and more in the LinkedIn app

  • View organization page for Delta Lake

    65,161 followers

    Delta Connect is a plugin atop Spark Connect, introduced in Spark 3.4, and enables gRPC communication using Protocol Buffers. ✅ This allows client implementations in languages such as Rust and Go to interact with Spark outside the JVM, provided they support the protocol. 🎥 Explore a detailed walk-through and the expanded feature set of Delta 4 in this webinar: https://lnkd.in/eyyud3pP cc Scott Haines (Buf), Youssef Mrini (Databricks) #opensource #oss #deltalake #spark

  • Are you wondering if general concepts like data engineering design patterns can help you learn about #DeltaLake? Or, if it's possible to leverage Delta Lake within your streaming data architecture? In this webinar, Scott Haines and Bartosz Konieczny will answer these two questions. Scott, who gained streaming expertise at Yahoo, Twilio, and Nike, will share with you best practices for leveraging Delta Lake as a component of your streaming architecture. ✅ Bartosz, who recently published Data Engineering Design Patterns, will reverse-engineer a few of these design patterns to explain which Delta Lake features make everything tick. 🗓️ Tuesday, Oct 14 🕝 9AM PT Don't miss it! 🔗 Register today: https://lnkd.in/eSRiXpDF #opensource #oss #dataarchitecture #dataengineering

    Diving into Streaming Data Design Patterns for Delta Lake

    Diving into Streaming Data Design Patterns for Delta Lake

    www.linkedin.com

  • What happens when robust, deterministic lakehouse design meets the power of AI agents? 👉 Reduced tool fragmentation 👉 Safer, reproducible automation 👉 Data teams enabled for true innovation On Tuesday, October 7 at 9AM PT, bauplan founders Ciro Greco and Jacopo Tagliabue will break down how function-based execution and Git-for-Data semantics pave the way for autonomous agentic workflows—cutting complexity and multiplying impact. 🎤 Moderated by Lisa N. Cao 🔗 Register here to reserve your spot: https://luma.com/OLAI-107 #openlakehouse #opensource #oss #aiagents

    View organization page for Delta Lake

    65,161 followers

    📣 Mark your calendar for Tuesday, October 7 at 9AM PT for the next Open Lakehouse + AI webinar: “From Functions to AI Agents: Reimagining the Lakehouse for an Agentic Future!” 🚀 Lakehouses promise unified analytics, ML, and governance—but current stacks leave teams juggling fragmented tools and interfaces. What’s needed are simple, uniform environments that are both human-friendly and machine-usable. ✅ The next step is agentic automation: AI agents that manage ingestion, testing, and deployment. To work safely, agents require isolated, deterministic, and reproducible environments—something most legacy stacks lack. Join bauplan founders Ciro Greco and Jacopo Tagliabue to explore how function-based execution, Git-for-Data semantics, and programmable abstractions make lakehouses agent-ready—cutting complexity and freeing teams for real innovation. 👏 🔗 Register here: https://luma.com/OLAI-107 #openlakehouse #opensource #oss #aiagents Lisa N. Cao

    This content isn’t available here

    Access this content and more in the LinkedIn app

Similar pages

Browse jobs