Just Enough Data Weekly Newsletter 7
From Future Data
Boss Talk: Future Data Edition
In this Future Data edition of their engaging Clubhouse series, a16z Co-Founder Ben Horowitz and Databricks Co-founder and CEO Ali Ghodsi join Sisu Founder and CEO Peter Bailis for a discussion on leadership, management, and more.
Databricks: The data lakehouse comes to you
Data + AI Summit is the event for the data community and Data + AI World Tour brings the energy of Summit to a whole new audience. With content, customers and speakers tailored to each region, the World Tour showcases why lakehouse is quickly becoming the global standard for data architecture.
5 Best Practices to Completely Eliminate Costly Data Copies With a SQL Lakehouse Platform
Your data teams and business users depend on mission-critical dashboards, BI tools and ad hoc queries to gain insight into your organizational data. You should be able to make data readily accessible directly from your cloud data lake storage with the speed and performance your business users are looking for, without creating optimized data copies. This whitepaper shares the 5 best practices that will help you completely eliminate expensive data copies with a SQL lakehouse platform.
Apache NiFi vs. Apache Airflow
Apache Airflow and Apache NiFi are, in fact, two whistles to a somewhat different tune. Still, you may be wondering which one is better suited for your expectations and goals. By the end of this article, you will no longer have doubts.
Github Backend Data
Ever wondered where does the data get stored when you commit to Github.
Here is the coolest post:Partitioning GitHub’s relational databases to handle scale
Lineage Metadata Where You Need It - Datafold’s GraphQL API
Column-level lineage lets you see where your data is coming from and going to in seconds. Zoom in to trace the flow and truly understand where the values in a column are coming from without digging through SQL code or spending hours hunting through PRs. This can be particularly helpful when you need to understand the downstream impact of a change in your pipeline or track where PII data is being used.
Data Governance 101: What it is, and Why it’s Critical to Your Business
In an increasingly data-driven world, organizations use data to remain competitive, find new opportunities, and provide better service to their customers. While using this data, organizations must comply with laws and regulations applicable to data privacy, storage, and processing and extract valuable insights.
No matter how proficient you are in your field. Having the basics clear is the main key.