Listen

Description

Telemetry has the potential to guide the future of Airflow, but only if it’s implemented transparently and with community trust. 


In this episode, we’re joined by Bolke de Bruin, Director at Metyis and a long-time Airflow PMC member. Bolke discusses how telemetry has been handled in the past, why it matters now and what it will take to get it right.


Key Takeaways:


(03:20) The role of foundations in establishing credibility and sustainability.

(04:52) Why data collection is critical to open-source project direction.

(07:24) Lessons learned from previous approaches to user data collection.

(10:23) The current state of telemetry in the project.

(10:53) Community trust as a prerequisite for technical implementation.

(12:54) The importance of managing sensitive data within trusted ecosystems.

(16:37) Ethical considerations in balancing participation and access.

(18:45) Forward-looking ideas for improving workflow design and usability.


Resources Mentioned:


Bolke de Bruin

https://www.linkedin.com/in/bolke/


Metyis | LinkedIn

https://www.linkedin.com/company/metyis/


Metyis | Website

http://www.metyis.com


Apache Airflow

https://airflow.apache.org/


Airflow Summit

https://airflowsummit.org/


Airflow Dev List

https://lists.apache.org/list.html?dev@airflow.apache.org


https://www.astronomer.io/events/roadshow/london/

   

https://www.astronomer.io/events/roadshow/new-york/ 

  

https://www.astronomer.io/events/roadshow/sydney/   


https://www.astronomer.io/events/roadshow/san-francisco/  

 

https://www.astronomer.io/events/roadshow/chicago/ 




Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.



#AI #Automation #Airflow #MachineLearning