podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Gleb Mezhanskiy
Shows
Data Engineering Podcast
The Future of Data Engineering: AI, LLMs, and Automation
SummaryIn this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and data engineering. He discusses the challenges and opportunities of integrating AI into data engineering, particularly using large language models (LLMs) to enhance productivity and reduce manual toil. The conversation covers the potential of AI to transform data engineering tasks, such as text-to-SQL interfaces and creating semantic graphs to improve data accessibility, and explores practical applications of LLMs in automating code reviews, testing, and understanding data lineage.AnnouncementsHello and welcome to the...
2025-02-26
59 min
Data Engineering Podcast
Accelerate Migration Of Your Data Warehouse with Datafold's AI Powered Migration Agent
SummaryGleb Mezhanskiy, CEO and co-founder of DataFold, joins Tobias Macey to discuss the challenges and innovations in data migrations. Gleb shares his experiences building and scaling data platforms at companies like Autodesk and Lyft, and how these experiences inspired the creation of DataFold to address data quality issues across teams. He outlines the complexities of data migrations, including common pitfalls such as technical debt and the importance of achieving parity between old and new systems. Gleb also discusses DataFold's innovative use of AI and large language models (LLMs) to automate translation and reconciliation processes in data migrations, reducing...
2024-10-28
48 min
Data Engineering Podcast
Reconciling The Data In Your Databases With Datafold
Summary A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consuming, especially when the source and destination speak different dialects of SQL. In this episode Gleb Mezhanskiy, founder and CEO of Datafold, discusses the different error conditions and solutions that you need to know about to ensure the accuracy of your data. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Dagster offers a new approach to building and running...
2024-03-17
58 min
Download High-Quality Full Audiobooks in Non-Fiction, Computers & Technology
97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts by Tobias Macey
Please visit https://thebookvoice.com/podcasts/1/audiobook/705738 to listen full audiobooks. Title: 97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts Author: Tobias Macey Narrator: Emily Beresford Format: Unabridged Audiobook Length: 5 hours 41 minutes Release date: November 28, 2023 Genres: Computers & Technology Publisher's Summary: Take advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often...
2023-11-28
30 min
Download High-Quality Full Audiobooks in Non-Fiction, Computers & Technology
97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts by Tobias Macey
Please visithttps://thebookvoice.com/podcasts/1/audiobook/705738to listen full audiobooks. Title: 97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts Author: Tobias Macey Narrator: Emily Beresford Format: Unabridged Audiobook Length: 5 hours 41 minutes Release date: November 28, 2023 Genres: Computers & Technology Publisher's Summary: Take advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often nagging...
2023-11-28
5h 41
Data Engineering Podcast
Strategies For A Successful Data Platform Migration
Summary All software systems are in a constant state of evolution. This makes it impossible to select a truly future-proof technology stack for your data platform, making an eventual migration inevitable. In this episode Gleb Mezhanskiy and Rob Goretsky share their experiences leading various data platform migrations, and the hard-won lessons that they learned so that you don't have to. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Introducing RudderStack Profiles. RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles...
2023-07-31
1h 09
Data Engineering Podcast
Build Better Tests For Your dbt Projects With Datafold And data-diff
Summary Data engineering is all about building workflows, pipelines, systems, and interfaces to provide stable and reliable data. Your data can be stable and wrong, but then it isn't reliable. Confidence in your data is achieved through constant validation and testing. Datafold has invested a lot of time into integrating with the workflow of dbt projects to add early verification that the changes you are making are correct. In this episode Gleb Mezhanskiy shares some valuable advice and insights into how you can build reliable and well-tested data assets with dbt and data-diff. Announcements ...
2023-06-12
48 min
The Data Stack Show
108: You Can’t Separate Data Reliability From Workflow with Gleb Mezhanskiy of Datafold
Highlights from this week’s conversation include:Gleb’s background and career journey (2:51)The adoption problems (10:53)How Datafold solves these problems (18:08)The vision for Datafold (26:27)Incorporating Datafold as a data engineer (38:53)The importance of the data engineer (42:12)Something to keep in mind when designing data tools (46:46)Implementing new technology into your company (53:18) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcome...
2022-10-12
1h 00
The Data Stack Show
The PRQL: Are Marketers the Worst Data Quality Offenders?
In this bonus episode, Eric and Kostas preview their upcoming conversation with Gleb Mezhanskiy of Datafold.
2022-10-07
04 min
Monday Morning Data Chat
Improving the Modern Data Stack w/ Gleb Mezhanskiy (CEO @ Datafold) - Special Episode
Gleb Mezhanskiy joins the show to chat about ways to improve the Modern Data Stack. Datafold: https://www.datafold.com/
2022-07-22
53 min
Data Engineering Podcast
Be Confident In Your Data Integration By Quickly Validating Matching Records With data-diff
Summary The perennial challenge of data engineers is ensuring that information is integrated reliably. While it is straightforward to know whether a synchronization process succeeded, it is not always clear whether every record was copied correctly. In order to quickly identify if and how two data systems are out of sync Gleb Mezhanskiy and Simon Eskildsen partnered to create the open source data-diff utility. In this episode they explain how the utility is implemented to run quickly and how you can start using it in your own data workflows to ensure that your data warehouse isn’t mi...
2022-07-03
1h 10
Data Engineering Podcast
A Reflection On The Data Ecosystem For The Year 2021
Summary This has been an active year for the data ecosystem, with a number of new product categories and substantial growth in existing areas. In an attempt to capture the zeitgeist Maura Church, David Wallace, Benn Stancil, and Gleb Mezhanskiy join the show to reflect on the past year and share their thought son the year to come. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the sh...
2022-01-02
1h 03
Building the Backend: Data Solutions that Power Leading Organizations
How To Effectively Reduce Data Quality Incidents 10x with Datafold
This episode features Gleb Mezhanskiy Co-Founder & CEO @ Datafold, during our discussion we talk all about data observability and how to improve your data quality. Before Datafold, Gleb was a founding member of data teams at Lyft and Autodesk, where he built sophisticated data platforms and developed tooling to improve productivity and data quality.Top 3 Value Bombs:The foundation of any data observability platform is the data catalog. Data observability becomes increasingly difficult the more data sets you have if you do not define your process to track and monitor your data. Do not surprise yo...
2021-09-21
39 min
Datacast
Episode 72: Folding Data with Gleb Mezhanskiy
Timestamps(01:42) Gleb shared briefly about his upbringing and studying Economics in university in Russia.(04:15) Gleb discussed his move to the US to pursue a Master of Information Systems Management at Carnegie Mellon University.(07:07) Gleb went over his summer internship as a Business Analyst at Autodesk.(08:40) Gleb shared the details of his project architecting data model/ETL pipelines as a PM at Autodesk.(11:34) Gleb unpacked the evolution of his career at Lyft — from an individual data analyst to a PM on data tooling and a high-impact project that he worked on.(16:54) Gleb shared valuable lessons from the experience of leadin...
2021-09-18
1h 07
Data Engineering Podcast
Strategies For Proactive Data Quality Management
Summary Data quality is a concern that has been gaining attention alongside the rising importance of analytics for business success. Many solutions rely on hand-coded rules for catching known bugs, or statistical analysis of records to detect anomalies retroactively. While those are useful tools, it is far better to prevent data errors before they become an outsized issue. In this episode Gleb Mezhanskiy shares some strategies for adding quality checks at every stage of your development and deployment workflow to identify and fix problematic changes to your data before they get to production. Announcements ...
2021-07-20
1h 01
Software Engineering Daily
Datafold: Data Quality Tooling with Gleb Mezhanskiy
Effective data science requires clean data. As data moves through the data pipeline, there may be errors introduced. Errors can also arise from code changes, database migrations, and other forms of data movement. How can you ensure data quality within a fast moving, dynamic data system? Datafold is a company built around data quality management. It allows users to compare tables and databases, as well as automate data QA. Gleb Mezhanskiy is a founder of Datafold and joins the show to talk about the data quality space and what he is building with Datafold.Sponsorship inquiries: sponsor@s...
2020-10-26
44 min