Look for any podcast host, guest or anyone
Showing episodes and shows of

Gleb Mezhanskiy

Shows

Data Engineering PodcastData Engineering PodcastThe Future of Data Engineering: AI, LLMs, and AutomationSummaryIn this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and data engineering. He discusses the challenges and opportunities of integrating AI into data engineering, particularly using large language models (LLMs) to enhance productivity and reduce manual toil. The conversation covers the potential of AI to transform data engineering tasks, such as text-to-SQL interfaces and creating semantic graphs to improve data accessibility, and explores practical applications of LLMs in automating code reviews, testing, and understanding data lineage.AnnouncementsHello and welcome to the...2025-02-2659 minData Engineering PodcastData Engineering PodcastAccelerate Migration Of Your Data Warehouse with Datafold's AI Powered Migration AgentSummaryGleb Mezhanskiy, CEO and co-founder of DataFold, joins Tobias Macey to discuss the challenges and innovations in data migrations. Gleb shares his experiences building and scaling data platforms at companies like Autodesk and Lyft, and how these experiences inspired the creation of DataFold to address data quality issues across teams. He outlines the complexities of data migrations, including common pitfalls such as technical debt and the importance of achieving parity between old and new systems. Gleb also discusses DataFold's innovative use of AI and large language models (LLMs) to automate translation and reconciliation processes in data migrations, reducing...2024-10-2848 minData Engineering PodcastData Engineering PodcastReconciling The Data In Your Databases With DatafoldSummary A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consuming, especially when the source and destination speak different dialects of SQL. In this episode Gleb Mezhanskiy, founder and CEO of Datafold, discusses the different error conditions and solutions that you need to know about to ensure the accuracy of your data. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Dagster offers a new approach to building and running...2024-03-1758 minDownload High-Quality Full Audiobooks in Non-Fiction, Computers & TechnologyDownload High-Quality Full Audiobooks in Non-Fiction, Computers & Technology97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts by Tobias MaceyPlease visit https://thebookvoice.com/podcasts/1/audiobook/705738 to listen full audiobooks. Title: 97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts Author: Tobias Macey Narrator: Emily Beresford Format: Unabridged Audiobook Length: 5 hours 41 minutes Release date: November 28, 2023 Genres: Computers & Technology Publisher's Summary: Take advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often...2023-11-2830 minDownload High-Quality Full Audiobooks in Non-Fiction, Computers & TechnologyDownload High-Quality Full Audiobooks in Non-Fiction, Computers & Technology97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts by Tobias MaceyPlease visithttps://thebookvoice.com/podcasts/1/audiobook/705738to listen full audiobooks. Title: 97 Things Every Data Engineer Should Know: Collective Wisdom from the Experts Author: Tobias Macey Narrator: Emily Beresford Format: Unabridged Audiobook Length: 5 hours 41 minutes Release date: November 28, 2023 Genres: Computers & Technology Publisher's Summary: Take advantage of today's sky-high demand for data engineers. With this in-depth book, current and aspiring engineers will learn powerful real-world best practices for managing data big and small. Contributors from notable companies including Twitter, Google, Stitch Fix, Microsoft, Capital One, and LinkedIn share their experiences and lessons learned for overcoming a variety of specific and often nagging...2023-11-285h 41Data Engineering PodcastData Engineering PodcastStrategies For A Successful Data Platform MigrationSummary All software systems are in a constant state of evolution. This makes it impossible to select a truly future-proof technology stack for your data platform, making an eventual migration inevitable. In this episode Gleb Mezhanskiy and Rob Goretsky share their experiences leading various data platform migrations, and the hard-won lessons that they learned so that you don't have to. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Introducing RudderStack Profiles. RudderStack Profiles takes the SaaS guesswork and SQL grunt work out of building complete customer profiles...2023-07-311h 09Data Engineering PodcastData Engineering PodcastBuild Better Tests For Your dbt Projects With Datafold And data-diffSummary Data engineering is all about building workflows, pipelines, systems, and interfaces to provide stable and reliable data. Your data can be stable and wrong, but then it isn't reliable. Confidence in your data is achieved through constant validation and testing. Datafold has invested a lot of time into integrating with the workflow of dbt projects to add early verification that the changes you are making are correct. In this episode Gleb Mezhanskiy shares some valuable advice and insights into how you can build reliable and well-tested data assets with dbt and data-diff. Announcements ...2023-06-1248 minThe Data Stack ShowThe Data Stack Show108: You Can’t Separate Data Reliability From Workflow with Gleb Mezhanskiy of DatafoldHighlights from this week’s conversation include:Gleb’s background and career journey (2:51)The adoption problems (10:53)How Datafold solves these problems (18:08)The vision for Datafold (26:27)Incorporating Datafold as a data engineer (38:53)The importance of the data engineer (42:12)Something to keep in mind when designing data tools (46:46)Implementing new technology into your company (53:18) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcome...2022-10-121h 00The Data Stack ShowThe Data Stack ShowThe PRQL: Are Marketers the Worst Data Quality Offenders?In this bonus episode, Eric and Kostas preview their upcoming conversation with Gleb Mezhanskiy of Datafold.2022-10-0704 minMonday Morning Data ChatMonday Morning Data ChatImproving the Modern Data Stack w/ Gleb Mezhanskiy (CEO @ Datafold) - Special EpisodeGleb Mezhanskiy joins the show to chat about ways to improve the Modern Data Stack.   Datafold: https://www.datafold.com/2022-07-2253 minData Engineering PodcastData Engineering PodcastBe Confident In Your Data Integration By Quickly Validating Matching Records With data-diffSummary The perennial challenge of data engineers is ensuring that information is integrated reliably. While it is straightforward to know whether a synchronization process succeeded, it is not always clear whether every record was copied correctly. In order to quickly identify if and how two data systems are out of sync Gleb Mezhanskiy and Simon Eskildsen partnered to create the open source data-diff utility. In this episode they explain how the utility is implemented to run quickly and how you can start using it in your own data workflows to ensure that your data warehouse isn’t mi...2022-07-031h 10Data Engineering PodcastData Engineering PodcastA Reflection On The Data Ecosystem For The Year 2021Summary This has been an active year for the data ecosystem, with a number of new product categories and substantial growth in existing areas. In an attempt to capture the zeitgeist Maura Church, David Wallace, Benn Stancil, and Gleb Mezhanskiy join the show to reflect on the past year and share their thought son the year to come. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the sh...2022-01-021h 03Building the Backend: Data Solutions that Power Leading OrganizationsBuilding the Backend: Data Solutions that Power Leading OrganizationsHow To Effectively Reduce Data Quality Incidents 10x with DatafoldThis episode features Gleb Mezhanskiy Co-Founder & CEO @ Datafold, during our discussion we talk all about data observability and how to improve your data quality. Before Datafold, Gleb was a founding member of data teams at Lyft and Autodesk, where he built sophisticated data platforms and developed tooling to improve productivity and data quality.Top 3 Value Bombs:The foundation of any data observability platform is the data catalog. Data observability becomes increasingly difficult the more data sets you have if you do not define your process to track and monitor your data. Do not surprise yo...2021-09-2139 minDatacastDatacastEpisode 72: Folding Data with Gleb MezhanskiyTimestamps(01:42) Gleb shared briefly about his upbringing and studying Economics in university in Russia.(04:15) Gleb discussed his move to the US to pursue a Master of Information Systems Management at Carnegie Mellon University.(07:07) Gleb went over his summer internship as a Business Analyst at Autodesk.(08:40) Gleb shared the details of his project architecting data model/ETL pipelines as a PM at Autodesk.(11:34) Gleb unpacked the evolution of his career at Lyft — from an individual data analyst to a PM on data tooling and a high-impact project that he worked on.(16:54) Gleb shared valuable lessons from the experience of leadin...2021-09-181h 07Data Engineering PodcastData Engineering PodcastStrategies For Proactive Data Quality ManagementSummary Data quality is a concern that has been gaining attention alongside the rising importance of analytics for business success. Many solutions rely on hand-coded rules for catching known bugs, or statistical analysis of records to detect anomalies retroactively. While those are useful tools, it is far better to prevent data errors before they become an outsized issue. In this episode Gleb Mezhanskiy shares some strategies for adding quality checks at every stage of your development and deployment workflow to identify and fix problematic changes to your data before they get to production. Announcements ...2021-07-201h 01Software Engineering DailySoftware Engineering DailyDatafold: Data Quality Tooling with Gleb MezhanskiyEffective data science requires clean data. As data moves through the data pipeline, there may be errors introduced. Errors can also arise from code changes, database migrations, and other forms of data movement. How can you ensure data quality within a fast moving, dynamic data system? Datafold is a company built around data quality management. It allows users to compare tables and databases, as well as automate data QA. Gleb Mezhanskiy is a founder of Datafold and joins the show to talk about the data quality space and what he is building with Datafold.Sponsorship inquiries: sponsor@s...2020-10-2644 min