Just Enough Data Weekly Newsletter 10
Data & AI Monetization
Generating Synthetic Data With GANs
Do we have enough data? Are our datasets imbalanced? How can we accelerate research while avoiding data leakage? Generative Adversarial Networks (GANs) are a promising AI solution to these questions. With GANs, we can generate more and better data that is more fair and generalizable, which can be used to improve ML models and algorithm testing. GANs are also promising in the field of data privacy, since they could break barriers to data sharing, allowing companies and institutions to accelerate research findings.
Decision Making at Netflix
This introduction is the first in a multi-part series on how Netflix uses A/B tests to make decisions that continuously improve our products, so we can deliver more joy and satisfaction to our members. Subsequent posts will cover the basic statistical concepts underpinning A/B tests, the role of experimentation across Netflix, how Netflix has invested in infrastructure to support and scale experimentation, and the importance of the culture of experimentation within Netflix.
Accelerating Early SaaS Growth While Building a Sustainable Business
6 Surprising AWS Charges You Should Monitor Closely
Every company should actively monitor its cloud costs, which can drive up the overall cloud bill. However, if you handle them properly, this can be avoided. Here are some AWS charges you should keep an eye on.
Accelerating Into the U.S. Grand Prix With McLaren
If you thought this week couldn’t get more thrilling, I’ve got news for you: We’re shifting into overdrive. On the heels of our action-packed customer and user conference, .conf21, we’re heading to Austin, Texas this weekend to cheer on our data-driven partners at McLaren Racing for the Formula 1 U.S. Grand Prix.
The Gartner Magic Quadrant for Metadata Management was just scrapped. Here’s everything you need to know
Metadata management started out as an IT discipline. As we embraced the internet, and as data types and formats exploded, IT teams were put in charge of creating an “inventory of data.”
Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data
From business communications and financial transactions to trip planning and activity tracking, much of our lives run through smartphones today. eGoGames will help you add competitive esports to that list.
The Future of the Data Engineer
One of the first data engineers at Facebook and Airbnb, he wrote and open sourced the wildly popular orchestrator, Apache Airflow, followed shortly thereafter by Apache Superset, a data exploration tool that’s taking the data viz landscape by storm. Currently, Maxime is CEO and co-founder of Preset, a fast-growing startup that’s paving the way forward for AI-enabled data visualization for modern companies.
Monitors as Code: A New Way to Deploy Custom Data Quality Monitors From Your CI/CD Workflow
Monte Carlo releases Monitors as Code, a new feature that allows data engineers to easily configure new data quality monitors as part of their daily workflow.
When you are in tough situation, just take a step back and relax. Give yourself a time, you will figure out a way :)