Pat Helland | Scalable OLTP in the Cloud: What’s the BIG DEAL? | #50
Disseminate: The Computer Science Research Podcast
by Jack Waudby
22h ago
In this thought-provoking podcast episode, we dive into the world of scalable OLTP (OnLine Transaction Processing) systems with the insightful Pat Helland. As a seasoned expert in the field, Pat shares his insights on the critical role of isolation semantics in the scalability of OLTP systems, emphasizing its significance as the "BIG DEAL." By examining the interface between OLTP databases and applications, particularly through the lens of RCSI (READ COMMITTED SNAPSHOT ISOLATION) SQL databases, Pat talks about the limitations imposed by current database architectures and application patterns o ..read more
Visit website
Rui Liu | Towards Resource-adaptive Query Execution in Cloud Native Databases | #49
Disseminate: The Computer Science Research Podcast
by Jack Waudby
1w ago
In this episode, we talk to Rui Liu and explore the transformative potential of Ratchet, a groundbreaking resource-adaptive query execution framework. We delve into the challenges posed by ephemeral resources in modern cloud environments and the innovative solutions offered by Ratchet. Rui guides us through the intricacies of Ratchet's design, highlighting its ability to enable adaptive query suspension and resumption, sophisticated resource arbitration for diverse workloads, and a fine-grained pricing model to navigate fluctuating resource availability. Join us as we uncover the future of clo ..read more
Visit website
Yifei Yang | Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries | #48
Disseminate: The Computer Science Research Podcast
by Jack Waudby
1M ago
In this episode, Yifei Yang introduces predicate transfer, a revolutionary method for optimizing join performance in databases. Predicate transfer builds on Bloom joins, extending its benefits to multi-table joins. Inspired by Yannakakis's theoretical insights, predicate transfer leverages Bloom filters to achieve significant speed improvements. Yang's evaluation shows an average 3.3× performance boost over Bloom join on the TPC-H benchmark, highlighting the potential of predicate transfer to revolutionize database query optimization. Join us as we explore the transformative impact of predicat ..read more
Visit website
Vikramank Singh | Panda: Performance Debugging for Databases using LLM Agents | #47
Disseminate: The Computer Science Research Podcast
by Jack Waudby
1M ago
In this episode, Vikramank Singh introduces the Panda framework, aimed at refining Large Language Models' (LLMs) capability to address database performance issues. Vikramank elaborates on Panda's four components—Grounding, Verification, Affordance, and Feedback—illustrating how they collaborate to contextualize LLM responses and deliver actionable recommendations. By bridging the divide between technical knowledge and practical troubleshooting needs, Panda has the potential to revolutionize database debugging practices, offering a promising avenue for more effective and efficient resolution of ..read more
Visit website
Tamer Eldeeb | Chablis: Fast and General Transactions in Geo-Distributed Systems | #46
Disseminate: The Computer Science Research Podcast
by Jack Waudby
2M ago
In this episode, Tamer Eldeeb sheds light on the challenges faced by geo-distributed database management systems (DBMSes) in supporting strictly-serializable transactions across multiple regions. He discusses the compromises often made between low-latency regional writes and restricted programming models in existing DBMS solutions. Tamer introduces Chablis, a groundbreaking geo-distributed, multi-versioned transactional key-value store designed to overcome these limitations. Chablis offers a general interface accommodating range and point reads, along with writes within multi-step strictly-ser ..read more
Visit website
Matt Butrovich | Tigger: A Database Proxy That Bounces With User-Bypass | #45
Disseminate: The Computer Science Research Podcast
by Jack Waudby
4M ago
Summary: In this episode, we chat to Matt Butrovich about his research on database proxies. We discuss the inefficiencies of traditional database proxies, which operate in user-space, causing overhead due to buffer copying and system calls. Matt introduces "user-bypass" which leverages Linux's eBPF infrastructure to move application logic into kernel-space. Matt then tells us about Tigger, a PostgreSQL-compatible DBMS proxy, showcasing user-bypass benefits. Tune in to hear about the experiments that demonstrate how Tigger can achieve up to a 29% reduction in transaction latencies and a 42% re ..read more
Visit website
Gábor Szárnyas | The LDBC Social Network Benchmark: Business Intelligence Workload | #44
Disseminate: The Computer Science Research Podcast
by Jack Waudby
4M ago
Summary: In this episode, Gábor Szárnyas takes us on a journey through the LDBC Social Network Benchmark's Business Intelligence workload (SNB BI). Developed through collaboration between academia and industry the SNB BI is a comprehensive graph OLAP benchmark. It pushes the boundaries of synthetic and scalable analytical database benchmarks, featuring a sophisticated data generator and a temporal graph with small-world phenomena. The benchmark's query workload, rooted in LDBC's innovative design methodology, aims to drive future technical advancements in graph database systems. Gabor highligh ..read more
Visit website
Thaleia Doudali | Is Machine Learning Necessary for Cloud Resource Usage Forecasting? | #43
Disseminate: The Computer Science Research Podcast
by Jack Waudby
5M ago
Summary: In this week's episode, we talk with Thaleia Doudali and explore the realm of cloud resource forecasting, focusing on the use of Long Short Term Memory (LSTM) neural networks, a popular machine learning model. Drawing from her research, Thaleia discusses the surprising discovery that, despite the complexity of ML models, accurate predictions often boil down to a simple shift of values by one time step. The discussion explores the nuances of time series data, encompassing resource metrics like CPU, memory, network, and disk I/O across different cloud providers and levels. Thaleia high ..read more
Visit website
Jinkun Geng | Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks | #42
Disseminate: The Computer Science Research Podcast
by Jack Waudby
6M ago
Summary: In this episode Jinkun Geng talks to us about Nezha, a high-performance consensus protocol. Nezha can be deployed by cloud tenants without support from cloud providers. Nezha bridges the gap between protocols such as MultiPaxos and Raft, which can be readily deployed, and protocols such as NOPaxos and Speculative Paxos, that provide better performance, but require access to technologies such as programmable switches and in-network prioritization, which cloud tenants do not have. Tune in to learn more! Links: Jinkun's Homepage Nezha VLDB'23 Paper Nezha GitLab Repo Hosted on Acast ..read more
Visit website
Dimitris Koutsoukos | NVM: Is it Not Very Meaningful for Databases? | #41
Disseminate: The Computer Science Research Podcast
by Jack Waudby
6M ago
Summary: In this episode, Dimitris Koutsoukos talks to us about Persistent or Non Volatile Memory (PMEM) and we answer the question: Is it Not Very Meaningful for Databases?  PMEM offers expanded memory capacity and faster access to persistent storage. However, (before Dimitris's work) there was no comprehensive empirical analysis of existing database engines under diferent PMEM modes, to understand how databases can benefit from the various hardware configurations. Dimitris and his colleagues have then analyzes multiple diferent engines under common benchmarks with PMEM in AppDirect mo ..read more
Visit website

Follow Disseminate: The Computer Science Research Podcast on FeedSpot

Continue with Google
Continue with Apple
OR