podcast
details
.com
Print
Share
Look for any podcast host, guest or anyone
Search
Showing episodes and shows of
Data On Kubernetes Community
Shows
Data on Kubernetes Community
Repel Boarders! How to find a Kubernetes operator that really protects your data | DoKC Town Hall
Repel Boarders! How to find a Kubernetes operator that really protects your dataPresented by Robert Hodges, AltinityOperators are a godsend for managing data in Kubernetes. But how about protecting it? We'll explore security threats to cloud native databases and show what protection you should look for in operators. Finally we'll introduce a new Data on Kubernetes Community project to develop security standards for database operators in Kubernetes. Related Links DoKC Website - https://dok.community/ DoKC Meetups - https://www.meetup.com/data-on-kubernetes-community/ Join Slack - https://join.slack.com/t/dokcommunity/shared_invite/zt-1...
2024-01-30
19 min
Data on Kubernetes Community
Formula 1 telemetry processing using Apache Kafka on Kubernetes // Paolo Patierno (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Video - https://youtu.be/4cPVRWOK-_E ABSTRACT Apache Kafka is the de facto data streaming platform used for ingesting vast amounts of data and processing them in real-time. Low latency analytics are vital if users are to react to events as fast as possible and to effectively shape future decision making. Together with Kubernetes, it allows to develop cloud oriented analytics solutions which are highly scalable. During this session, we will explore an Apache Kafka based solution running o...
2022-11-02
15 min
Data on Kubernetes Community
Choosing Kubernetes for Stateful Applications // Akshay Ram & Peter Schuurman (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Video - https://youtu.be/Y4tdy9lctEI ABSTRACT Learn how customers are increasingly deploying stateful applications on Kubernetes to benefit from portability, economies of scale, and built-in orchestration capabilities. This talk will include how customers choose between using Kuberentes, or a data Software as a Service (SaaS) and stateful capabilities of Kubernetes across two dimensions - the application orchestration and the storage layer. Also learn about MariaDB SKYSQL, a database software as a service that runs thousands of StatefulSet Pods across multiple z...
2022-11-02
18 min
Data on Kubernetes Community
Kubernetes 360º - Data driven observability - from Secrets to logs // Ben Hirschberg (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Video - https://youtu.be/A1ch4AhKoeQ ABSTRACT If there’s one thing that everyone can agree on - it’s that the sheer scale and complexity of Kubernetes operations is growing constantly. What’s more, cloud native environments are becoming more and more expensive to operate and manage, as well as increasingly difficult to secure. On the bright side, there is a growing ecosystem of exceptional open source tools to help overcome this complexity, and provide greater situational awareness to wha...
2022-11-02
17 min
Data on Kubernetes Community
Shifting Left Stateful Applications In Kubernetes // Viktor Farcic (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Video - https://youtu.be/LymPjH6HA3E ABSTRACT Stateless apps are easy to manage. More often than not, a Kubernetes Deployment, with a Service, Ingress, and Horizontal Pod Autoscaler (HPA) is enough. Almost everyone can do it. But, when it comes to stateful applications, things become a bit more complicated. We might need a database and storage. We might need to manage database users and schema. We might need to consider quite a few other things. Stateful apps are harder for everyone, especially...
2022-11-02
15 min
Data on Kubernetes Community
Medical - Healthcare Data on Kubernetes // Olyvia Rakshit & Prasad Dorbala (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT Healthcare organizations are transforming their applications and embracing digital platforms for efficient patient care. Today, compute at the edge, plays a critical role in deploying innovative healthcare applications that promise new approaches to patient care. Connected care platforms at scale are turning to Kubernetes infrastructure for deployment of telehealth and telemedicine apps. These kinds of applications often need real time image processing and inferencing at the edge, which in turn, require a distributed application platform on Kubernetes. They also need secure foundations for a re...
2022-11-02
13 min
Data on Kubernetes Community
Highly Available Postgres Clusters In Kubernetes // John Long & Jonathan Gonzalez (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT A practical session about running Highly Available PostgreSQL in Kubernetes. The primary objective will be to demonstrate how to set up a reliable architecture in a Kubernetes cluster to achieve low RTO and RPO. This will be covered by going over the various Kubernetes native components of our fully open source operator CloudNativePG[0]. It will be demonstrated how these components allow for a simple adoption of PostgreSQL into Kubernetes environments. If time and technical constraints permit there will be a live demo at the en...
2022-11-02
15 min
Data on Kubernetes Community
Inter-Cluster PostreSQL on Kubernetes // Julian Fischer (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT In this talk you’ll explore how to run a PostgreSQL cluster across multiple Kubernetes clusters. Learn what challenges arise when using asynchronous streaming replication in a set of Kubernetes clusters spanning across several geographical regions. It will be discussed how such a topology can be established using Kubernetes and Kubernetes Add-Ons. You will learn which qualities the system has, where such a distributed system plays its strengths and where it may be problematic to use it. At the end of the talk you wil...
2022-11-02
17 min
Data on Kubernetes Community
Open Source Databases on Kubernetes- Best Practices // Peter Zaitsev (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT So you’re looking to run your Open Source Database on Kubernetes. What best practices should you follow and what pitfalls should you avoid ? In this presentation we will look at how to run stateful applications on Kubernetes overall as well as what is particularly important for databases - we will cover high availability, security, backups and disaster recovery. Finally we will show how these practices can be implemented with Percona Operators for MySQL, MongoDB, PostgreSQL - one of the leading solutions to run Open...
2022-11-02
16 min
Data on Kubernetes Community
The Kubernetes Native Database // Jeffrey Carpenter (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT In the software industry we’re fond of terms that define major trends, like “cloud native”, “Kubernetes native” and “serverless”. As more and more organizations move stateful workloads to Kubernetes, we’ve started to see these terms applied to data infrastructure, where they can get overtaken by marketing hype unless we work to define them. In this talk, we’ll examine two different databases, TiDB and Apache Cassandra, in order to identify what it means for a database to be Kubernetes native and why it matters...
2022-11-02
16 min
Data on Kubernetes Community
Databases on Kubernetes: Why are they important? // With Bhavin Shah, Xing Yang, Gabriele Bartolini & Patrick McFadin (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT Kubernetes has crossed the chasm, but what about stateful applications and databases? Join us for this panel discussion and learn more about how organizations are deploying different databases like PostgreSQL and Cassandra on Kubernetes, what are the benefits of running databases on Kubernetes, and how the ecosystem is working towards making these things boring, so you can focus on your applications! We will have an interactive discussion with the hosts and guests of the Kubernetes Bytes podcast, and open it up to the audience...
2022-11-02
34 min
Data on Kubernetes Community
The Challenges of Data Processing On Kubernetes - A look at Spark, Flink, Dask, and Ray // Holden Karau (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT This talk will go through both the improvements that have been made in Kubernetes for batch analytic workloads as well as some of the current pain experienced by users and developers moving their workloads to Kube. In this talk you will learn about how we “cheated” back in the YARN and Mesos days to make things go fast, why Kubernetes doesn’t like those cheats, and what some alternatives are.
2022-10-31
20 min
Data on Kubernetes Community
Why we decided to migrate our Jaeger storage to ClickHouse on Kubernetes // Arul Jegadish Francis (DoK Day North America 2022)
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) Abstract We at OpsVerse provide a DevOps tools platform with fully-managed open source-based tools. One of our key offerings is a holistic observability platform. Metrics and logs are straightforward to aggregate, however traces – which are collected using CNCF Jaeger – were left with some holes in advanced insights. In this talk, I will explain why we decided to switch our Jaeger storage backend to ClickHouse, and why we decided to run it on Kubernetes. I will also share tips and tricks to migrate and run Cli...
2022-10-28
13 min
Data on Kubernetes Community
Dok #152-Running PostgreSQL in Kubernetes:from day 0 to day 2 with CloudNativePG // Gabriele Bartolini
https://go.dok.community/slack https://dok.community With: Gabriele Bartolini - Vice President/CTO of Cloud Native and Kubernetes, EDB Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK Imagine this: you have a virtual infrastructure based on Kubernetes, made up of virtual data centers, possibly spread across multiple Kubernetes clusters and regions. Your infrastructure could even be hosted on premises or on different cloud service providers. Infrastructure as Code is a requirement. You’ve been tasked to run Postgres databases, alongside your applications....
2022-09-28
1h 03
Data on Kubernetes Community
Dok Talks #148 - Cost and Kubernetes // Chris Love
https://go.dok.community/slack https://dok.community With: Chris Love - Managing Partner, LionKube Bart Farrell - Head of Community, Data on Kubernetes Community ABSTRACT OF THE TALK Using Kubernetes to run data workloads costs less than running the same workloads on separate servers. But how do we save at least twenty to thirty percent more? We need the data and a cost-savings culture. We will explore a project called Open Cost: cost monitoring for cloud-native environments. Secondly, we will explore why company culture is more important than cost...
2022-09-27
45 min
Data on Kubernetes Community
DoK Talks #142 - Kubernetes Cluster Upgrade Strategies and Data: Best Practices for your Stateful Workload // Peter Schuurman
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK How do you make sure your Stateful Workloads remain available when your Kubernetes infrastructure updates? This talk will discuss different strategies of upgrading a Kubernetes cluster, and how you can manage risk for your workload. The talk will showcase demos of each upgrade strategy. BIO Peter is a Senior Software Engineer on GKE at Google. He works on improving Kubernetes for Stateful workloads. His main focus is on enhancing the Kubernetes ecosystem for high availability applications. KE...
2022-08-18
58 min
Data on Kubernetes Community
DoK Talks #144 - Mastering MongoDB on Kubernetes, the power of operators // Arek Borucki
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK During my first talk for DoK community I want to walk you through the world of NoSQL database MongoDB and Kubernetes Operators - Community Edition, Enterprise Edition (MongoDB and Ops Manager on K8s), and Atlas operator, highlight the most important capabilities, talk about use cases and challenges, the theory will be mixed with a live demos! BIO I'm a SRE / NoSQL / DevOps professional. I hold CKA, CKAD, CKS, also I’m MongoDB Certified DBA and MongoDB Champion. I have exper...
2022-07-26
1h 00
Data on Kubernetes Community
DoK Talks #135 - DoK isn't just Database on Kubernetes // Patrick McFadin
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK What about your streaming and analytic workloads? If you are all-in on Kubernetes you can't forget about these important parts of your infrastructure. I'll talk about the current state of the art. Why organizations may hesitate to go beyond deploying databases in Kubernetes and most important, some key things you need to be successful. BIO Patrick McFadin is the co-author of the upcoming O’Reilly book “Managing Cloud-Native Data on Kubernetes” He currently works at DataStax in Developer Relati...
2022-06-10
46 min
Data on Kubernetes Community
DoK Talks #132 - Time-series on SQL Server on Kubernetes on ARM64… without SQL Server! // Álvaro Hernández
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Oh wow. What a weird title. Full of terms that don’t fit together. Or do they? This talk is for believers, those who believe that the author will deliver what the title says. Here’s the promise on what you will see: * Time-series specialized storage and functions on SQL Server. * SQL Server running on Kubernetes. * Using ARM64 (aarch64) nodes. * In reality, without using SQL Server, but rather Postgres Babelfish! The talk will b...
2022-06-02
1h 05
Data on Kubernetes Community
Why run Postgres in Kubernetes (DoK Day EU 2022) // Gabriele Bartolini
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Postgres should run inside your Kubernetes cluster. Yes, inside, not outside Kubernetes. After all, a database should be seen as an application, a special type of application - for which it is legitimate to require an additional level of care and attention. However, the small price you pay for this is worth the return on investment that your organization receives by running microservice applications, plus database combos, inside your Cloud Native infrastructure.
2022-05-28
10 min
Data on Kubernetes Community
What's New in Kubernetes Storage (DoK Day EU 2022) // Xing Yang
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Kubernetes SIG Storage is responsible for ensuring storage is available for containers in a pod when the pod is scheduled on a node. There is the Container Storage Interface (CSI) for block and file storage that allows storage providers to write CSI drivers. There is also a COSI sub-project that is trying to add object storage support in Kubernetes. In this session, Xing will give an update on some of the features that SIG S...
2022-05-28
09 min
Data on Kubernetes Community
What we've learned from running a PostgreSQL managed service on Kubernetes (DoK Day EU 2022) // Oleksii Kliukin
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Kubernetes is an emerging platform of choice for deploying and running PostgresSQL. Deploying 100 Postgres clusters is as easy as deploying one, and there is no need to tinker with tools like Ansible or Puppet. Resource sharing can be applied when it makes sense, allowing to run multiple Postgres databases in isolation on a single instance, each storing the data on a dedicated persistent volume. There are great open-source tools out there to deal with h...
2022-05-28
11 min
Data on Kubernetes Community
Using Kubernetes to deliver a “serverless” service (DoK Day EU 2022) // Jim Walker
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Serverless promises to change the way we consume software. It allows us to potentially pay for only that which we use and can help drive down operational costs to the minimal amount of resources necessary. Architecting for serverless requires a unique look at app logic and the way it is deployed. It takes a combination of the logical and physical worlds. An architectural pattern has emerged where we can scale ephemeral compute s...
2022-05-28
20 min
Data on Kubernetes Community
The many uses of Kubernetes cross cluster migration of persistent data (DoK Day EU 2022) // Ryan Kaw
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Multiple clusters exist in most Kubernetes environments today, and number of clusters will increase overtime. The reasons for having multiple Kubernetes clusters are many, for example, overcoming scale limits, reducing complexity, geo separation, redundancy and having separate production, staging, and development environments. Once you have multiple K8S clusters, it can be useful to have the ability to easily move or duplicate workloads across these different clusters. Kubernetes does not have a native method t...
2022-05-28
07 min
Data on Kubernetes Community
The future of data on Kubernetes with Adobe and CNCF (DoK Day EU 2022) // Joseph Sandoval, Xing Yang & Sylvain Kalache
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Some data-intensive workloads are easier to run in Kubernetes than others. Why? What needs to improve? Join us as we deep dive with Adobe and the CNCF about how easy (or not) it is to run different types of data workloads on Kubernetes – and what is being done both inside and outside of Kubernetes to make data workloads easier. Xing Yang is a Tech Lead in the Cloud Native Storage team at VMw...
2022-05-28
17 min
Data on Kubernetes Community
The Data on Kubernetes Landscape (DoK Day EU 2022) // Melissa Logan & Sylvain Kalache
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) We know from the first Data on Kubernetes Report that 90% of respondents believe Kubernetes is ready for stateful workloads, but significant challenges remain. The DoK Community continues to grow and build a unique space where people share knowledge and have conversations that are shaping the next decade of data on Kubernetes. Melissa Logan and Sylvain Kalache will discuss the growth of DoK, DoKC, and introduce a new project to help end users o...
2022-05-27
10 min
Data on Kubernetes Community
Running Kafka on Kubernetes, across three clouds at Adobe (DoK Day EU 2022) // Adi Muraru
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Adobe runs dozens of Kafka clusters spread across both public (AWS and Azure) and private clouds to power the Adobe Experience Platform message bus. In this talk, we share the experience of running Kafka as a stateful Kubernetes workload in a cloud-native environment using the open-source Koperator and the tools we built to automate the whole lifecycle. While running a stateful service like Kafka on Kubernetes may be intimidating at t...
2022-05-27
16 min
Data on Kubernetes Community
Running a database on local NVMes on Kubernetes (DoK Day EU 2022) // Tomáš Nožička & Maciej Zimnoch
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Running a database on Kubernetes with persistent storage is relatively easy but when it comes to performance it won’t match local NVMes. This talk will show you how to set up the local NVMes for Kubernetes, how to handle the application and cluster lifecycle in a safe manner and share our experience with running ScyllaDB with local NVMes on different Kubernetes cloud providers. Tomas leads the development of Scylla Operator (https://git...
2022-05-27
09 min
Data on Kubernetes Community
Protecting data with CSI Volume Snapshots on Kubernetes (DoK Day EU 2022) // Grant Griffiths
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) The container storage interface (CSI) is a contract between different container orchestrators (Kubernetes, Nomad, etc) and storage plugins. This contract is a set of gRPC services for provisioning, utilizing, and snapshotting storage volumes. In this talk, we will focus on one aspect of the CSI spec: Volume Snapshots. The talk will cover volume snapshots on Kubernetes CSI, with a deep dive on Kubernetes CSI snapshots. A short demo on the basics of t...
2022-05-27
11 min
Data on Kubernetes Community
Operating FoundationDB on Kubernetes (DoK Day EU 2022) // Johannes M. Scheuermann
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) FoundationDB is an open-source distributed transactional Key-Value store that is used by multiple companies like Apple, Snowflake and VMWare Tanzu (previously Wavefront). This talk will cover the design of the FoundationDB operator and lessons learned from operating FoundationDB on Kubernetes. We will discuss some of the missing pieces in Kubernetes to make it easier to operate FoundationDB on top of it and how we solved those challenges in the operator. We will f...
2022-05-27
08 min
Data on Kubernetes Community
One Click to Run Apache Spark as a Service on Kubernetes (DoK Day EU 2022) // Bo Yang
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) It is still challenging to run Apache Spark and other big data processing workload on Kubernetes, especially in large scale. People need to address various issues like resource isolation, queuing, and cost efficiency. This session will share details about those challenges and how to address them. We will also present a convenient (one-click) way to deploy Apache Spark on Kubernetes, and dramatically lower the barrier to use Spark. Bo has been working i...
2022-05-27
09 min
Data on Kubernetes Community
Microservices and Kubernetes for your Full Data Lifecycle (DoK Day EU 2022) // Steve Pousty
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Data doesn’t magically appear in our data centers. There are usually several phases and several storage locations along its journey throughout your organization. New architectural patterns, such as microservices, and new technology, such as Kubernetes are changing how we can think about and manage the large volumes of data coming at us. In this talk we will begin by quickly introducing the architecture and technology and how they make our lives bet...
2022-05-27
14 min
Data on Kubernetes Community
Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Developers (DoK Day EU 2022) // Arsh Sharma, Lapo Elisacci & Ramiro Berrelleza
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Kubernetes comes with a lot of useful features like Volumes and StatefulSets, which make running stateful workloads simple. Interestingly, when combined with the right tools, these features can make Kubernetes very valuable for developers wanting to run massive production databases in development! This is exactly what was seen at "Extendi". The developers at Extendi deal with a large amount of data in their production Kubernetes clusters. But when developing locally, they didn't h...
2022-05-27
14 min
Data on Kubernetes Community
Is your database in Kubernetes production ready (DoK Day EU 2022) // Mykola Marzhan
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) It only looks simple to run databases in Kubernetes. In fact, it is too many things needed to be considered before running any database in Kubernetes. Failover and traffic switching, replication and data consistency/loss after failover, upgrades, DB and node-level configuration, CNI, backups, monitoring, etc. After this talk, you will have a complete list of questions that should be checked before running a database in production. Mykola has been shorting “Time-to-market” in so...
2022-05-27
15 min
Data on Kubernetes Community
Growing up fast - Kubernetes and Real-Time Analytic Applications (DoK Day EU 2022) // Robert Hodges
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Kubernetes is turning into a preferred platform for real-time analytic app that crunch billions of events per day and return insights in seconds. In this talk we'll introduce the standard analytic app design pattern of fast event streams coupled with low-latency data warehouses, using open source projects. We'll then walk through deploying the pipeline on Kubernetes from ingest to end user access. We'll touch on use of operators, scaling, monitoring, upgrade, security, and approaches t...
2022-05-27
15 min
Data on Kubernetes Community
Datashim - a framework for declarative management of datasets on Kubernetes (DoK Day EU 2022) // Srikumar Venugopal
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Many ML pipelines depend on shared filesystems for input, output and intermediate data storage. Standards such as CSI have made it possible for applications in Kubernetes to access a variety of data storage systems. Yet, data scientists still have to deal with low-level details of data access in order to execute their pipelines in Kubernetes. Datashim is a framework that manages the lifecycle of a Dataset object, a CustomResourceDefinition that represents a...
2022-05-27
15 min
Data on Kubernetes Community
Bringing Apache Cassandra closer to Kubernetes (DoK Day EU 2022) // Jake Luciani
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) What does Kubernetes provide that allows us to reduce the complexity of Apache Cassandra while making it better suited for cloud native deployments? That was the question we started with as we began a mission to bring Cassandra closer to Kubernetes and eliminate the redundancy. Many great open source databases have been adapted to run on Kubernetes, without relying on the deep ecosystem of projects that it takes to run in Kubernetes(there is a difference...
2022-05-27
09 min
Data on Kubernetes Community
Autoscaling Stateful Workloads in Kubernetes (DoK Day EU 2022) // Mohammad Fahim Abrar & Md. Kamol Hasan
https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Managing stateful workloads in a containerized environment has always been a concern. However, as Kubernetes developed, the whole community worked hard to bring stateful workloads to meet the needs of their enterprise users. As a result, Kubernetes introduced StatefulSets which supports stateful workloads since Kubernetes version 1.9. Users of Kubernetes now can use stateful applications like databases, AI workloads, and big data. Kubernetes support for stateful workloads comes in the form of StatefulSets. And as we all k...
2022-05-27
10 min
Data on Kubernetes Community
Dok Talks #130- Leaning on Kubernetes Portability to Manage Databases Anywhere // Robert Hodges
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK What if databases only ran in a single place? That would be useless. But it's what we get with most database-as-a-service offerings in the public cloud. Kubernetes has the potential to change the game and let users run complex database anywhere they want. This talk tells the story of how we built a portable ClickHouse cloud service on Kubernetes. It runs on AWS and GCP, of course. Even better, we're working on managing databases in *any* Kubernetes cluster, even those behind firewalls...
2022-05-04
1h 04
Data on Kubernetes Community
Dok Talks #126- Automatically Instrument Kubernetes Apps with OpenTelemetry // James Blackwood-Sewell
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK The rise of Kubernetes has triggered an exponential growth of metric and trace data. This talk explores capturing and persisting telemetry data from Kubernetes apps without any specific code changes using the OpenTelemetry Operator. BIO James Blackwood-Sewell is currently a Principal Developer Advocate at TimescaleDB in Sydney. Before that he meandered through the PostgreSQL, Kubernetes and Prometheus landscapes as the Chief Architect at Jirotech. KEY TAKE-AWAYS FROM THE TALK Storing observability data from Kubernetes...
2022-04-27
1h 03
Data on Kubernetes Community
Dok Talks #128- Getting Started with the Kubernetes Secrets Store CSI Driver // Kim Schlesinger
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK In Kubernetes, it can be difficult to keep application API keys, access tokens and passwords safe. There are several different approaches to solving this problem, and in this talk Kim will demonstrate how to install Hashicorp Vault and the Secrets Store CSI Driver so that your applications can access secrets stored in ephemeral volumes. BIO Kim is technologist and educator, and she currently works as a Developer Advocate at DigitalOcean. Kim lives in Denver, Colorado in the USA.
2022-04-22
53 min
Data on Kubernetes Community
Dok Talks #124 - Intro to Druid on Kubernetes // Sergio Ferragut
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This talk will provide a high-level overview of Kubernetes, Helm charts and how they can be used to deploy Apache Druid clusters of any size. We'll review how Kubernetes functionality enables resilience and self-healing, historical tiers through node group affinity, middle manager scaling through Kubernetes autoscaling to optimize ingestion capacity and some of the gotchas along the way. BIO Sergio Ferragut is a database veteran turned Developer Advocate at Imply. His experience includes 16 years at Teradata...
2022-04-08
54 min
Data on Kubernetes Community
Dok Talks #121 - Running Stateful Apps in Kubernetes Made Simple // Steve Buchanan
https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Eventually the time will come to run a stateful app in Kubernetes. This can be a scary thing adding more moving parts to a Kubernetes cluster and deploying as well as managing your app on Kubernetes when it requires state. In this talk Steve Buchanan will take you through a journey of understanding how storage works in Kubernetes, how to Persistent state with pods, what storage options are available with Azure Kubernetes Service, best practices, and a demo of deploying...
2022-03-11
1h 00
Data on Kubernetes Community
Dok Talks #113 - Developing Stateful Application on Kubernetes // Rob Pacheco
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Modern web applications are typically comprised of multiple services which utilize storage in a variety of ways. Utilizing storage in Kubernetes introduces challenges that are not obvious while developing locally. We’ll introduce these challenges and show good and bad ways to handle them. We’ll dive into the details of an application along with its deployment in Kubernetes to understand why certain storage patterns are problematic and how they can be modified to behave well in a Kubernetes-based deployment. BIO
2022-01-27
53 min
Data on Kubernetes Community
Dok Talks #112 - Production Postgres Made Easy on Kubernetes // Jonathan Katz
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK It's one thing to get a database up and running on Kubernetes; it's another to run it in production. There are many "Day 2" features to think about when running any database in production, including high availability, backups + RTO/RPO, monitoring, security, and disaster recovery. Combining Kubernetes with the Operator pattern can help make this easier, and add advanced features like self-healing, simple scaling, secure credential management, and more. PGO, the open source Postgres Operator from Crunchy Data, makes...
2022-01-23
1h 02
Data on Kubernetes Community
Dok Talks #110 - Overview of the Oracle Database Operator for Kubernetes // Paulo Simoes
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK An introduction to OraOperator, the open source project that makes Oracle Database Kubernetes-Native. BIO Paulo works as the Developer Evangelist at Oracle focuses on Microservices, Application Development with Converged Oracle Database, and Event Mesh. His current technical expertise focuses on Cloud Native architecture and best practices, including containerization and microservices development. KEY TAKE-AWAYS FROM THE TALK Oracle Database as a Kubernetes-native platform. Simplified way to operator Oracle Database.
2022-01-16
44 min
Data on Kubernetes Community
Dok Talks #109 - Benchmarking for PostgreSQL workloads in Kubernetes / Part 2 // Gabriele Bartolini
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK 6 months have passed since our last DoK webinar about benchmarking PostgreSQL workloads in a Kubernetes environment. In the meantime, many things have happened at EDB, and we’re happy to share what we’ve learned in this timeframe. We’ll use cnp-bench and cnp-sandbox to help us describe some of the challenges we might face when running PostgreSQL workloads, how to spot them, and what actions to take to make your databases healthier and more longeve. cnp-bench is a collection of Hel...
2021-12-19
1h 04
Data on Kubernetes Community
Dok Talks #108 - Postgres on Kubernetes Applied at Scale in Zalando // Álvaro Hernández & Alexander Kukushkin
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Some of the topics that may or may not be discussed on this live talk could be: - Other than Patroni and the operator, what else was needed to fully switch to Kubernetes? - What is left, what is the future going to bring to the world of Kubernetes and Postgres operators? Expect strong opinions, and facts. Live. Uncensored. Postgres. On. Kubernetes. BIO Álvaro is a passionate database and software developer. He founded a...
2021-12-15
1h 02
Data on Kubernetes Community
DoK Talks #107- Cross region replication in Percona Kubernetes Operators // Ege Gunes
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK With Percona Kubernetes Operators you can migrate your data to Kubernetes or have a disaster recovery site to recover it in unfortunate times. I'll talk about the implementation, challenges and caveats of such setups and what the future may bring.
2021-12-11
45 min
Data on Kubernetes Community
DoK Talks #106- Cloud native data warehousing with Kubernetes // Mark Cusack & Matthew Ripley
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Yellowbrick's cloud data warehouse is natively integrated with Kubernetes. Not only does Kubernetes provide the elasticity, resilience and upgradeability expected of a modern data warehouse, it also enables Yellowbrick to be deployed anywhere: in public clouds, private clouds and at the network edge. During this talk, we will dive into the details of how we use Kubernetes to power our data warehouse solution, how we've created a SQL interface to Kubernetes, and give a live demo. We'll also talk about the business...
2021-12-03
59 min
Data on Kubernetes Community
DoK Talks #100- CAPE for data backup/restore on kubernetes // Sanjeev Ganjihal
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK CAPE is an open source solution offering advanced kubernetes features for DR, data migration and mobility, multi-cluster app deployment etc. I will demo how to install CAPE and also cover how to backup and restore data using CAPE KEY TAKE-AWAYS Learn about a new open source tool for kubernetes backup/restore BIO I come from a strong infrastructure background with expertise in infrastructure automation, containers, kubernetes, DevOps and service mesh.
2021-11-15
1h 02
Data on Kubernetes Community
DoK Talks #101- Redpanda: how to build a storage engine for kubernetes // Alexander Gallego
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK How to build a stateful, transactional streaming storage engine on top of kubernetes. KEY TAKE-AWAYS FROM THE TALK 1. Building storage engines in kubernetes is hard 2. You need tiered storage (S3 or google cloud bucket, Azure Blob) for total disaster recovery 3. You also need sound replication inside the cluster 4. You need a simple architecture so you can scale the pods easily BIO Alex Gallego is the founder and CEO of Vectorized, where...
2021-11-12
1h 01
Data on Kubernetes Community
DoK Talks #99- ETL/ELT on Kubernetes with Airbyte: K8s Development Insights // Abhi Vaidyanatha
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK ETL/ELT on Kubernetes is currently an unsolved problem. There are a lot of different approaches vying for a spot as the de facto method, but none are clear winners. Considering that the cloud-native landscape is built for deploying Dockerized, open-source software, many of the closed-source solutions fall flat and don't mesh with the trajectory of the community. Airbyte is an open-source ETL/ELT tool that harmonizes well with the cloud-native landscape and lives to enable your stateful workloads on...
2021-11-05
1h 00
Data on Kubernetes Community
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes // Adheip Singh
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK This talk will introduce Druid-Operator (https://github.com/druid-io/druid-operator) and how Kubernetes and Operator framework can be used to write an operator that enables provisioning, management, and scaling of a complex cluster of Apache Druid to 1000s of nodes. This talk will cover why Kubernetes and the operator framework is a perfect fit for managing a complex stateful application. Learnings and pitfalls faced while writing the Druid-Operator and the special considerations to be taken into account when managing and...
2021-09-30
55 min
Data on Kubernetes Community
DoK Student Sessions- Data on Kubernetes: What's a poor DBA to do? // Derek Downey
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Database Administrators (DBAs) have jobs today because someone recognized that data is critical. It's critical to businesses, who need it to make good decisions. It's critical to users, who need to know how many people liked their recent post. DBAs developed process and maybe even automation to ensure that this critical data is available and secure, while accessing it is fast. And now someone wants them to put that critical data on Kubernetes and...
2021-09-25
1h 11
Data on Kubernetes Community
DoK Talks #90- Build Your Kubernetes Operator with the Right Tool! // Rafal Leszko
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK You want to build a Kubernetes Operator for your software. Which tool to choose? Operator SDK with Helm, Ansible, or Go? Or maybe start from scratch with Python, Java, or any other programming language? And what is the right phase in the Operator Capability/Maturity Model that you should provide? In my talk I'll present: - Different ways of building Kubernetes Operators - Demo of building the same Operator using different tools - Methods used by the...
2021-09-17
1h 03
Data on Kubernetes Community
DoK Talks #84- Build your IDP with Otomi, the first and only Kubernetes-based O.S // Sander Rodenhuis
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Kubernetes is a platform for building platforms and is not really developer-friendly. That's why red Kubes created Otomi, the first and only Kubernetes-based operating system. Install Otomi with a single Helm Chart, login, and start deploying applications with only a few clicks. Use Otomi to build your Enterprise Container Platform or Internal Developer Platform in minutes. In this talk, I'm going to introduce Otomi. We'll start by looking at the architecture and how you can install Otomi on any Kubernetes...
2021-09-03
1h 06
Data on Kubernetes Community
DoK Talks #82- What we learned from 100 Kubernetes postmortems and how to prevent them // Shimon Tolts
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK A smart person learns from their own mistakes, but a truly wise person learns from the mistakes of others. When launching our product, we wanted to learn as much as possible about typical pains in our ecosystem, and did so by reviewing many post-mortems (100+!) to discover the recurring patterns, anti-patterns, and root causes of typical outages in Kubernetes-based systems. In this talk we have aggregated for you the insights we gathered, and in particular will review the most obvious DON’Ts and so...
2021-09-01
52 min
Data on Kubernetes Community
DoK Talks #79- Running Apache Pulsar in Kubernetes // Chris Bartholomew
https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK When I founded Kesque in 2019, my goal was to Kubernetes as the base platform for all our software. Because Kesque was a cloud-based SaaS powered by Apache Pulsar, that meant getting Apache Pulsar, a high-performance streaming solution and Kafka alternative, up and running in Kubernetes. In this talk, I will give an overview of Apache Pulsar and describe how we got Pulsar up and running in Kubernetes. We will cover some of the features of Pulsar that make it "cloud-native" and easy...
2021-08-26
1h 06
Data on Kubernetes Community
DoK Talks #78- Leverage AI, Big Data workload on Kubernetes with Volcano // Kevin (Zefeng) Wang
ABSTRACT OF THE TALK Volcano is a system for running high-performance workloads on Kubernetes. It features powerful batch scheduling capability that Kubernetes cannot provide but is commonly required by many classes of high-performance workloads, including Machine Learning, Deep Learning, Big Data, Bioinformatics Computing, etc. These types of workloads typically run on generalized domain frameworks like TensorFlow, Spark, PyTorch, MPI, etc. In this talk, Kevin will introduce how Volcano is integrated with these frameworks to allow users to run their applications without extra adaptation efforts while enjoying remarkable batch scheduling. BIO Kevin Wang is...
2021-08-26
1h 10
Data on Kubernetes Community
DoK Talks #77- Kubernetes Essential Tools // Javier Ramos
ABSTRACT OF THE TALK In this talk I will try to summarize my favorite tools for Kubernetes with special emphasis on the newest and lesser known tools which I think will become very popular. BIO Javier has over 15 years of experience in the IT industry as an enterprise architect with especial emphasis on Big Data and DevOps. He has worked in small startups and also large multinationals such Oracle, IBM or Cisco. He holds several certifications in AWS, GCP and Azure and writes regularly on Medium.com. Certified Java Architect/AWS...
2021-08-25
1h 08
Data on Kubernetes Community
DoK Talks #71 Introducing Kubestr: A new way to benchmark your Kubernetes storage // Michael Cade
Abstract of the talk… Benchmarking storage is not a new concept, this has been happening on storage for a long time. But have we overlooked the benchmarking capabilities or at least the ease in which to achieve this in a cloud-native, container-based, Kubernetes landscape? There has been a rise in stateful workloads and support around persistent storage in Kubernetes is improving. Now we can take our traditional workloads such as SQL Server, Oracle and SAP alongside our data stores for microservices with the same storage system for MongoDB, Cassandra, Redis, MySQL and PostgreSQL. With each of these st...
2021-08-20
1h 01
Data on Kubernetes Community
DoK Talks #76 AHOY - open source release manager for Kubernetes // Neil White
Abstract of the talk… AHOY is a release manager for Kubernetes, fully open source. Bio… I thought, this is easier explained, in, interpretive rap... Look, if you had one shot, one opportunity To fix everything that was ever broke In one deployment Would you rebuild it, or just let it fail? Yo His palms are sweaty, config weak, server load is heavy There's vomit on this keyboard already, Kubernetes He's nervous, but the control plane says Nodes are ready Fork Bomb, but he keeps on forgetting Commands he wrote down, to restart all the...
2021-08-19
1h 08
Data on Kubernetes Community
DoK Talks #75 Run production-grade Databases on Kubernetes using KubeDB // Tamal Saha
Abstract of the talk… Kubernetes has emerged as the de-facto way to deploy modern containerized apps on cloud or on-premises. Despite all that growth on the application layer, it remains challenging to run a database in a distributed environment like Kubernetes. However, many developers want to treat data infrastructure the same as application stacks. Operators want to use the same tools for databases and applications and get the same benefits as the application layer in the data layer: rapid spin-up and repeatability across environments. This is where KubeDB by AppsCode comes as a solution. KubeDB by AppsCode is...
2021-08-18
1h 08
Data on Kubernetes Community
Intro to Persistence in Kubernetes
Abstract of the talk… So you want want to run stateful containers? A brief intro to persistence in Kubernetes. Kubernetes and persistent storage go together like oil and water. Kubernetes is inherently an ephemeral system and persistent storage by definition must survive. After his talk, you should have a clear understanding of the how to get started on the path to successfully manage a persistent data storage solution on your Kubernetes cluster Bio… Kunal - Junior pursuing Computer Science & Engineering. Co-founder at Code for Cause. CNCF Intern 2020. MLH Coach. Google Summer of Code Ment...
2021-08-06
57 min
Data on Kubernetes Community
DoK Talks #70 - YugabyteDB - Distributed SQL Database on Kubernetes // Amey Banarse
Abstract of the talk… Kubernetes has hit a home run for stateless workloads, but can it do the same for stateful services such as distributed databases? Before we can answer that question, we need to understand the challenges of running stateful workloads on, well anything. In this talk, we will first look at which stateful workloads, specifically databases, are ideal for running inside Kubernetes. Secondly, we will explore the various concerns around running databases in Kubernetes for production environments, such as: - The production-readiness of Kubernetes for stateful workloads in general - The pros and cons of the variou...
2021-08-04
1h 12
Data on Kubernetes Community
DoK #69- To Certify or Not to Certify, is Kubernetes Certification Worth it? // Keith McClellan
Abstract of the talk… As an engineer, should I consider getting a certification? What makes a certification valuable to me or my employer? How do I pick which one to get? Will these really help me build stateful applications on Kubernetes? In this talk, we will discuss the relative value of certifying on different technologies, with a specific focus on CNCF certifications for administration of k8s and developing Kubernetes-native applications. In this session we will discuss: - The pros and cons of getting certified - Why your current and future employers might care about your certifications - What...
2021-08-02
1h 09
Data on Kubernetes Community
DoK Talks #68- The Kubernetes-native way to providing database services to developers // Adam Sandor
Bio… Adam is a Solutions Architect at Styra, helping companies adopt Cloud Native tech. Coming from a Java-dev background he is most excited about the space where software development and operations meet. When he is not flying in DCS World he is hacking on CI/CD, GitOps and Kubernetes Operators. Abstract of the talk… As Kubernetes is becoming the infrastructure platform of choice in many companies, database teams are struggling with the question whether to run databases on the cluster or outside of it. This talk will not answer that age-old question. I woul...
2021-07-28
58 min
Data on Kubernetes Community
DoK #65 Using Kubernetes and ClickHouse to enable high performance app analytics // Robert Hodges
Abstract of the talk… Embedded analytics are a major source of value to application users. Virtually every SaaS offering has them or is adding them now. This talk shows how to build low latency analytic applications on Kubernetes with ClickHouse, a popular, open source data warehouse. We'll start with the ClickHouse Kubernetes Operator to manage data warehouses, then cover ingest and visualization options to build a complete apps. Since this is a K8s talk, we'll of course geek out on the underlying plumbing as well. Bio… Robert is database geek. He started with...
2021-07-16
1h 06
Data on Kubernetes Community
DoK #63 Stranger Danger - Kubernetes Edition // Matt Jarvis
Abstract of the talk… Kubernetes is a powerful set of abstractions, but it's flexibility and configurability means it's pretty insecure by default. In this hands on talk, I'll show how an attacker can expand the blast radius of an exploit from a vulnerable web application in a container to owning the entire cluster. I'll also cover some ways in which you can prevent this happening to you ! Bio… Matt Jarvis is a Senior Developer Advocate at Snyk. Matt has spent more than 15 years building products and services around open source software, on everything from...
2021-07-13
1h 06
Data on Kubernetes Community
DoK #62 Easy Kubernetes Volumes using Longhorn // Saiyam Pathak
Abstract of the talk… Longhorn is a lightweight, reliable, and powerful distributed block storage system for Kubernetes. It is an open source tool that can be installed on any Kubernetes Cluster. It has features like incremental snapshots and backup that can be backed up to NFS or S3-compatible object storage. In this talk, you will learn about Longhorn, its features including backup/recovery, and how you can take maximum benefit for your persistent Kubernetes volumes. You will also be shown a UI to understand the features in a much better way. Bio… Saiy...
2021-07-13
1h 10
Data on Kubernetes Community
DoK #61 Perfecting Machine Learning Workloads on Kubernetes // Lars Suanet
Abstract of the talk… More and more applications are powered by Machine Learning (ML) models. Where the gap between Software Engineers and a Production environment on Kubernetes is already big, the gap between Data Scientists and that same production environment is enormous. In this talk, we will provide you with a framework for translating ML requirements into infrastructural requirements and concrete Kubernetes resources. In the first half of this talk, we will discuss how ML applications are different from most other applications, how ML workloads are structured and how ML requirements translate into Kubernetes resource configurations. In th...
2021-07-02
1h 04
Data on Kubernetes Community
DoK #60 Intro to Kubernetes // Aitor Artola & Kunal Kushwaha
Abstract of the talk… In this event we will introduce Kubernetes, containers and the cloud native initiative. You will get an overview of the benefits of containers running on Kubernetes and the new mindset that it requires, a new mindset driven by the cultural change the cloud native initiative is promoting. Concepts related to microservices and automation will be covered giving an overview of the different kinds of open tools you can find in the cloud native ecosystem to build an run modern applications in the cloud. Sections 1.- Introduction to Kubernetes & Cloud Native 2.- Docker & Containers. Mi...
2021-07-02
1h 46
Data on Kubernetes Community
Postgres on Kubernetes Hands-On-Lab // Álvaro Hernández
Abstract of the talk… From 0 to 60/100 (depending on where you live) in just 2h! It may sound "slow" if you talk about cars, but when you talk about databases in general and Postgres in particular, it isn't! From an empty Kubernetes cluster, you will leave the session with one or more Postgres clusters created, all with: high availability and automatic failover; automated backups with lifecycle; distributed logs with lifecycle; Web Console to manage it all; connection pooling; tuned Postgres and connection pool configurations; and any number of installed Postgres extensions. All this in just a single session! BY...
2021-07-01
2h 01
Data on Kubernetes Community
DoK #58 Benchmarking for PostgreSQL workloads in Kubernetes // Gabriele Bartolini & Francesco Canovai
Abstract of the talk… Databases like PostgreSQL cannot run on Kubernetes. That’s the refrain we hear all the time, as well as the motivation for us to break this barrier, once and for all. Hear the story of our journey so far about bringing PostgreSQL to Kubernetes. Discover why we believe that benchmarking both the storage and the database before production leads to a healthier and longer experience of the DBMS, even in Kubernetes. We’ll be sharing our process, and the results obtained so far, and unveil our plans for the future. Bio… A...
2021-06-18
1h 04
Data on Kubernetes Community
DoK #57 Key Criteria for Evaluating Kubernetes Data Storage // Enrico Signoretti
Abstract of the talk… Enterprises of all sizes are embracing hybrid cloud strategies that are ever more complex and structured, moving quickly from a first adoption phase, where data and applications are distributed manually and statically across different on-premises and cloud environments, to a new paradigm in which data and application mobility is the key to flexibility and agility. Now organizations want the freedom to choose where applications and data should run dynamically, depending on any number of business, technical, and financial factors. Kubernetes is instrumental in executing this vision, but it needs the right integration with in...
2021-06-17
1h 01
Data on Kubernetes Community
DoK #53 Day Zero - Azure Kubernetes Service // Raj Balakrishnan
Abstract of the talk… Are you new to azure kubernetes service and just want to see how the nuts and bolts come together ? This is the talk to be. Single slide and a end to end demo on how to run your first container on aks. Bio… I contribute in building Digital Experiences for our Farmers Worldwide, through Platforms and Services. Passionate about building Distributed and Scalable Systems in Microsoft Azure. My Day to Day Work is around - Apps, Data, Storage, Identity, Integration, Serverless and Site Reliability Engineering #PlatformEngineering #AKS #Kubernetes #EventDrivenArchitecture #Micr...
2021-06-09
1h 00
Data on Kubernetes Community
DoK #52 Enterprise-grade Kubernetes requirements // Haseeb Budhani
Abstract of the talk… We'll discuss best practices companies are adopting for enterprise-grade Kubernetes Management. Bio… Haseeb Budhani is the CEO of Rafay Systems, which he co-founded in late 2017. Prior to Rafay, Haseeb oversaw Akamai Technologies' expansion into the enterprise marketas the company's Vice President of Enterprise Strategy. Akamai acquired Haseeb's previous company, Soha Systems, in 2016. Haseeb co-founded Soha in 2013 and served as the company's CEO. Prior to Soha, Haseeb served as the Chief Product Officer for Infineta Systems, where he was responsible for overseeing all aspects of the company's product marketing, marketing comm...
2021-06-05
1h 01
Data on Kubernetes Community
#1 DoK Community in Hindi: "Pehle Kadam Data on Kubernetes Community mein! // Kunal Kushwaha
Abstract of the talk… Kya hota hai Kubernetes? Shuruwat kahan se kare? Community ka hissa kaise bane? Kya aap ke mann mein bhi ye sawaal aate hain? Join kariye hume iss meetup mein jahan hum baat karengey har cheez Data on K8s ke baare mein (Hindi mein)! May 3rd ko hum charcha karengey ki kaise aap community ka hissa ban sakte hain, CNCF kya hai, ek SRE ka kaam kya hota hai, and bahot kuch! But yehi nahi! Bhaag lijiye meetup ke end ki quiz mein jisse aap jeet sakte hain kuch special SWAGS DoK ki ta...
2021-05-04
1h 02
Data on Kubernetes Community
DoK Community #43 Kubecost: open source cost monitoring for Kubernetes // Webb Brown
Abstract of the talk… Measuring costs in Kubernetes environments is complex. Applications and their resources needs are often dynamic. Teams share resources without transparent prices attached to workloads while organizations are increasingly running resources on a range of machine types and even cloud providers. Kubecost provides an approach built on open source for ensuring consistent and accurate visibility across all your workloads. This discussion will talk about practical examples for implementing cost monitoring & optimization and managing the data that is generated from these efforts. Bio… Webb Brown is a Co-founder and CEO at Kube...
2021-04-25
1h 02
Data on Kubernetes Community
DoK Community #42 Spark on Kubernetes is Now Generally Available: Why & How to Migrate to It // Jean-Yves Stephan
Abstract of the talk… Apache Spark natively runs on top of Kubernetes (instead of Hadoop YARN) since 2018, but it's only since Spark 3.1 (released in March 2021) that the integration is now officially generally available & production-ready. What is the high-level architecture of Spark on Kubernetes, how does it compare to alternatives, what does the migration look like? These are some of the questions we will answer together. We will first introduce the core concepts, then go through the stories of customers who migrated, and then give you concrete technical tips to help you be successful with Spark (on Kubernetes). If...
2021-04-23
1h 03
Data on Kubernetes Community
DoK Community #41 Designing Stateful Apps for the Cloud and Kubernetes // Evan Chan
Abstract of the talk… Almost all applications have some kind of state. Some data processing apps and databases have huge amounts of state. How do we navigate a cloud-based world of containers where stateless and functions-as-a-service is all the rage? As a long-time architect, designer, and developer of very stateful apps (databases and data processing apps), I’d like to take you on a journey through the modern cloud world and Kubernetes, offering helpful design patterns, considerations, tips, and where things are going. How is Kubernetes shaking up stateful app design? - What kind of state is there, a...
2021-04-20
1h 01
Data on Kubernetes Community
#38 DoK Community: Patterns to create stateful applications on Kubernetes // Prashant Ghildiyal
Abstract of the talk… In this talk we will discuss what are the best patterns to create stateful applications on top of Kubernetes. This will include application layer caching, embeddable database as well as leveraging kubernetes objects to store and sync state across multiple replicas. Bio… Prashant is passionate about democratizing best AppOps practices for Kubernetes and has started Devtron and open source platform for AppOps on top of Kubernetes. Recently open sourced Devtron has been instrumental in adoption of Kubernetes in some of the largest startups in India.
2021-04-08
1h 10
Data on Kubernetes Community
#29 DoK Community: How Absa Developed Cloud Native Global Load Balancer for Kubernetes // Yury Tsarev
Abstract of the talk… Global load balancing, commonly referred to as GSLB (Global Server Load Balancing) solutions, have typically been the domain of proprietary network software and hardware vendors and installed and managed by siloed network teams. k8gb is a completely open source, cloud native, global load balancing solution for Kubernetes. k8gb focuses on load balancing traffic across geographically dispersed Kubernetes clusters using multiple load balancing strategies to meet requirements such as region failover for high availability. Global load balancing for any Kubernetes Service can now be enabled and managed by any operations or development te...
2021-03-27
54 min
Data on Kubernetes Community
DoK en español #1- Nuestros aprendizajes con Kubernetes // Aitor Artola, Miriam González, Raquel López Ruiz e Isidro Nistal
Our learnings from Kubernetes
2021-03-27
1h 06
Data on Kubernetes Community
#35 DoK Community: Make Kubernetes your development environment // Ramiro Berrelleza
https://go.dok.community/slack Abstract of the talk… Developers spend a lot of time making their local machine look like a cluster. But why do we do that? Our local machine is not where our code is supposed to run! We built okteto (github.com/okteto/okteto) so we can make our Kubernetes clusters look like our local machine. In this talk, we'll show you how okteto helps you take advantage of all the goodness of Kubernetes and the cloud without having to sacrifice a really fast development and feedback loop. Bio… ...
2021-03-18
1h 02
Data on Kubernetes Community
#32 DoK Community: How to choose a Kubernetes distribution for on-prem environments? // Tomasz Cholewa (Presenter: Bart Farrell)
Abstract of the talk… Buy a ready off-the-shelf product, customize an existing open source project, or build your own distribution? When you can't go to the cloud and leverage its powerful features you have to make a choice. On-prem environments need more attention, but they also often can be more cost-effective and are highly coveted by the development and operations teams. In this talk, I will cover some of the most important topics related to building an on-prem Kubernetes platform and I will describe the most popular distributions. Bio… Tomasz Chol...
2021-03-07
1h 13
Data on Kubernetes Community
#30 DoK Community: Kyverno for Kubernetes! // Jim Bugwadia. (Presenter: Bart Farrell)
Abstract of the talk… Kubernetes is powerful but can be complex to manage! In this talk, Jim Bugwadia from Nirmata will show how policy managers can help address the complexity via admission controls and dynamic configurations. Jim will introduce Kyverno, a Kubernetes native policy engine and CNCF sandbox project. Jim will then demonstrate how you can use Kyverno to ensure security and best practice compliance for your clusters. Bio… Software engineer with background in distributed systems and scale computing. Founder at Nirmata. Previously worked at Cisco, Pano Logic, Trapeze Networks, Bell Labs, and Moto...
2021-02-24
1h 03
Data on Kubernetes Community
#28 DoK Community: Getting Started Contributing to Kubernetes // Rin Oliver & Savitha Raghunathan. (Presenter: Bart Farrell)
https://go.dok.community/slack Abstract of the talk… This talk will walk through how to get started contributing to Kubernetes, combatting imposter syndrome, the many other ways you can get started contributing to K8s other than by writing code, and the benefits to joining a community such as K8s. Bio… Rin is a Technical Community Builder at Camunda. They enjoy discussing all things open source, with a particular focus on diversity in tech, improving hiring pipelines in OSS for those that are neur...
2021-02-11
56 min
Data on Kubernetes Community
DoK Nederkube Edition #1: Is Kubernetes ready for Data Management? // Michel de Ru, Jeffry Molanus & Arie van den Bos
Abstract of the talk… Kubernetes became the standard for micro services architectures. But what about handling massive and scalable data management on top of it? Is it possible and what does it mean for operations? Cassandra has been adopted widely and accepted globally as the most scalable and reliable database. Now it adds ease of use by offering a Kubernetes native plug and play solution for enterprise use! Bios… Michel has been specializing in NoSQL technologies and Distributed Data Management for about 10 years. Working with enterprise organizations like ABN AMRO, Rabo...
2021-01-30
1h 00
Data on Kubernetes Community
#22 DoK community: Vitess Operator for Kubernetes // Alkin Tezuysal
In this talk, I would like to uncover our newly announced Vitess Operator for Kubernetes. This talk demonstrates the sample implementation of Vitess in Kubernetes topology. I also explore common DBA tasks by demonstrating how they are handled in the Vitess ecosystem. Vitess, out of the box, comes with a lot of tools and utilities that one has to either incorporate or develop to manage MySQL topology. Let’s take a look at the capabilities of Vitess in these areas and demonstrate how they are performed under the operator realm. Alkin Tezuysal has extensive ex...
2020-12-15
58 min
Data on Kubernetes Community
#21 DoK community: Data on Kubernetes: my insights // Eduard Tomàs
Data handling is one of the hardests things in Kubernetes. This talk will be an informal conversation about things (relateded to data management) Eduard found helping customers to embrace Kubernetes. I hope you find them useful! Eduard Tomàs is a passionate developer since he was a child. He started programming with a Spectrum 48 and here still he is! Now he is focused on helping customers to embrace the cloud. Due to that, he is working in cloud-ready, cloud-native technologies and distributed architectures.
2020-12-09
1h 01
Data on Kubernetes Community
#20 DoK community: Tips and tricks to get Kubernetes certifications // Eneko Pérez & Carlos Gómez Carrero
CKA (Certified Kubernetes Administrator) has a bad reputation as the hardest certification many people have faced. In this talk, we will go through the process to pass successfully the exam, tips on the exam itself, the environment and any other question that might arise. How to fly into a kubernetes certification. Eneko Pérez is a DevOps Engineer evolved from many years as a systems administrator. Carlos Gómez is a Kubernetes (CKA | CKAD) | Solutions Architect | DevOps Architect trying to get his place into the Cloud
2020-12-01
56 min
Data on Kubernetes Community
#16 DoK community: HyperStore-C: S3 object storage managed by Kubernetes // Gary Ogasawara
Cloudian’s HyperStore is S3-compatible object storage software focused on the enterprise market. In this talk, I'll discuss how and why we are working on Kubernetes-managed versions of HyperStore, including where we are now and what we're looking. Gary Ogasawara is Cloudian’s first Chief Technology Officer, responsible for setting the company’s long-term technology vision and direction. Before assuming this role, he was Cloudian’s founding engineering leader. Prior to Cloudian, Gary led the Engineering team at eCentives, a search engine company. He also led the development of real-time commerce and advertising systems at Inktomi, an Intern...
2020-11-30
55 min
Data on Kubernetes Community
Dok Season 1 Extras - #2 - El paso por Pivotal Cloud Foundry a Kubernetes // Alexander Herranz
En este episodio, Alexander Herranz nos habla sobre la localización de los datos de las empresas mediante la comparativa entre Openshift o Kubernetes. Algunos temas que tratamos: El paso por Pivotal Cloud Foundry a Kubernetes, hasta llegar a Openshift. Ventajas y desventajas de comenzar a trabajar con contenedores bajo el paraguas de Openshift. Una manera de garantizar buenas prácticas en el proceso de desarrollo: Despliegues e integración continua (CI/CD), monitorización y seguridad.
2020-11-05
31 min
Data on Kubernetes Community
#14 DoK community: Kubernetes Cost Control // Arie van den Bos
For our 14th installation of the data on k8s community meetup, we talked with Cloud System Engineer / Architect Arie van den Bos. // Abstract: In this meetup, Arie discussed the following: The importance of cost control while working with the cloud. K8S, Data & Cost Control. Hints/Tips around controlling your K8S costs. // Bio: Arie is a Cloud Solutions Engineer with over 10 years of experience. He has been a team leader and helped previous companies reduce their cloud costs through optimization strategies. As a Solution Architect, Arie has been d...
2020-10-26
51 min
Data on Kubernetes Community
#13 DoK community: Distributed Workloads on Kubernetes Operators to the Rescue // Sebastien Guilloux
For our 13th installation of the data on k8s meetup, we will be talking with Senior Software Engineer Sebastien Guilloux from Elastic about Distributed workloads on k8s and how operators play a part in that! // Abstract: How easily can you run distributed workloads on Kubernetes? The initial deployment of your 10-nodes database might be easy to setup, but day-2 operations (changing the configuration, adding and removing nodes, version upgrades, etc.) are much more complicated. We'll discuss how operators can help you manage distributed workloads, and a few operator tricks we learned while...
2020-10-14
58 min
Data on Kubernetes Community
#10 DoK community: Data on Kubernetes and container attached storage - an update // Evan Powell
For our 10th installation of the data on k8s community meetup, we talk with CEO of Mayadata Evan Powell about container attached storage, Portworx acquisition, openEBS, can open source make it, and we geek out a bit with questions from the audience. // Key takeaways: Kubernetes - extended by CAS and other software - is not just solving the problems of running data on Kubernetes, it is fast becoming the preferred platform for data. // Abstract: Back in 2018 the CNCF published a blog we wrote called Container Attached Storage. Today - September 22...
2020-09-24
58 min
Data on Kubernetes Community
#7 DoK community: Conway’s Law & Kubernetes: Centralization vs. small team autonomy // Joseph Sandoval & Mike Tougeron
Data on Kubernetes #7: Conway’s Law & Kubernetes - Centralization vs small team autonomy with Mike Tougeron, Lead Site Reliability Engineer, at Adobe & Joseph Sandoval , SRE Manager, Platform Infrastructure, at Adobe. Loosely coupled teams, loosely coupled workloads and loosely coupled data - on a built for everyone platform? Abstract: Big clusters or small clusters? Where to draw the line and how to know whats best for your use case? We speak with Joseph and Mike from Adobe about the inevitable questions that arise when running k8s at scale. If it is run by t...
2020-09-03
56 min
Data on Kubernetes Community
#3 DoK community: Design considerations for operationalizing Distributed SQL on Kubernetes // Nikhil Chandrappa
Distributed databases on kubernetes And we just keep rolling along! Round 3 of the data on kubernetes community meetup! This time we will be talking with Nikhil Chandrappa Lead Software engineer at YugabyteDB. We will take a Practical look at running distributed SQL on Kubernetes using YugabyteDB Key takeaways: - Introduction to YugabyteDB Distributed SQL databases and its design principles - Design considerations for operationalizing Distributed SQL on Kubernetes - Deployment strategies for clustered Databases - Storage orchestration on Kubernetes - Yugabyte's approach for DBAAS on Kubernetes - DB Creation, Scale up / Scale down - Implementing Da...
2020-08-06
58 min