Data on Kubernetes Community Blog
134 FOLLOWERS
After discussions with thousands of companies and individuals running data workloads on Kubernetes we've come to see that there is a need for a sharing of patterns and concerns about how to build and operate data-centric applications on Kubernetes. As a result, the Data on Kubernetes Community (DoKC) was born. Follow our blog where technologists share their stories, wisdom, and practical..
Data on Kubernetes Community Blog
2d ago
This talk was an under-the-hood look at developing an open source DBaaS. We looked at the components that are used for its backend services, frontend, and the inner workings of...
The post Anatomy of a DBaaS: Bringing self-serve databases to Kubernetes with open source first appeared on Data on Kubernetes Community.
The post Anatomy of a DBaaS: Bringing self-serve databases to Kubernetes with open source appeared first on Data on Kubernetes Community ..read more
Data on Kubernetes Community Blog
1M ago
The Data on Kubernetes (DoK) Community is conducting its annual survey to understand the technology issues and trends for running data and stateful workloads on Kubernetes. This year’s survey will focus on DoK ecosystem technologies and AI/ML workloads. We invite you to be part of this year's survey and contribute to the collective knowledge of our community.
Our previous reports have revealed key insights from business executives and technology leaders using DoK. The inaugural report found that 90% of respondents believed Kubernetes was ready for stateful workloads, with 70% already running ..read more
Data on Kubernetes Community Blog
1M ago
We are thrilled to announce that Lori Lorusso, Head of Community at Percona, has been appointed as co-chair for the Data on Kubernetes Special Interest Group (SIG). This appointment marks an exciting new chapter for our community, and we're confident that Lori's extensive experience and passion will drive significant advancements in our mission.
Lori brings a wealth of expertise in community building, developer relations, and open-source leadership to this role. Her impressive track record includes:
2023 Marketing Chair of the Cloud Native Computing Foundation (CNCF)
2022-2024 Outreach Comm ..read more
Data on Kubernetes Community Blog
1M ago
For years, Apache Kafka relied on Apache ZooKeeper for maintaining its metadata and coordination. But that is coming to an end. After a lot of work in the Apache Kafka...
The post Released From the Cage: Apache Kafka Without Its ZooKeeper first appeared on Data on Kubernetes Community.
The post Released From the Cage: Apache Kafka Without Its ZooKeeper appeared first on Data on Kubernetes Community ..read more
Data on Kubernetes Community Blog
1M ago
This talk both described and demonstrated how stateful applications, including GPU-accelerated AI/ML workflows, can be automatically hot restarted after pod kill/eviction events that are augmented by transparent memory-snapshotting techniques. Applications...
The post Lightning Talk: Enabling Hot Restart of Stateful Applications Including GPU-Accelerate AI/ML Workloads first appeared on Data on Kubernetes Community.
The post Lightning Talk: Enabling Hot Restart of Stateful Applications Including GPU-Accelerate AI/ML Workloads appeared first on Data on Kubernetes Community ..read more
Data on Kubernetes Community Blog
2M ago
Data on Kubernetes and stateful applications have gained remarkable adoption across the community. But why stop there? Kubernetes and cloud-native tools can provide compelling core technologies for building sophisticated data ecosystems, from advanced metadata handling to workflows and events. Enter the realm of “dataspaces,” a transformative concept empowering organizations to seamlessly integrate and synchronize data sharing patterns for diverse existing data landscapes, that can even extend across organizational boundaries. Our session gave practical examples how Kubernetes and open source ..read more
Data on Kubernetes Community Blog
2M ago
There’s not much doubt that databases now run well on Kubernetes: operators have matured, storage management works, and there are lots of success stories. What do you do now? Build your own data platform to replace expensive, proprietary cloud services! Argo CD and Flux make it possible to integrate databases, data ingest, visualization, integration, and operations into a single platform that deploys from GitHub or GitLab. Our talk reviewed open source projects for data platforms configuration as well as standard design patterns for applying them in real systems.
Speakers:
Robert Hodges – CEO ..read more
Data on Kubernetes Community Blog
2M ago
Efficient data handling traditionally involves constructing robust pipelines to process information from diverse sources. However, recent open-source tools question this approach and propose an alternative: rather than detailing data processing steps, why not focus on the relationships between data objects?
This gave rise to the concept of Assets—entities like SQL tables, Parquet files, or S3 objects. Instead of defining the pipeline that creates an entity, the focus shifts to specifying how various assets interconnect and highlighting their relationships.
By using a Cloud Native orchestrator ..read more
Data on Kubernetes Community Blog
2M ago
We put out the call for Data on Kubernetes community members to submit applications for the Ambassador program, and you responded. We had an overwhelming response of amazing candidates. Several existing ambassadors are returning and will be joined by a roster of new DoK experts. The DoK Ambassador program started a year ago as a way to acknowledge the leaders in the DoK Community. These are individuals who have a thirst for sharing their knowledge and helping others in the community. This year is no different. The adoption of running data workloads on K8s has continued to grow and, with that ..read more
Data on Kubernetes Community Blog
2M ago
Unleash PostgreSQL’s potential in Kubernetes with CloudNativePG, a community-driven control plane reshaping the database landscape. Join a dedicated CloudNativePG maintainer and active Postgres contributor on a captivating journey through managing highly available clusters in the Cloud Native era. Discover best practices for large-scale databases: architecture, deployment on bare metal or virtual machines in Kubernetes, storage optimization, robust backup, recovery strategies, vertical scalability, and performance tuning. Gain insights into real-world challenges and battle-tested solutions for ..read more