How to Build a Recommender System using Rockset and OpenAI Embedding Models
Rockset Blog
by Ankit Khare
2w ago
Overview In this guide, you will: Gain a high-level understanding of vectors, embeddings, vector search, and vector databases, which will clarify the concepts we will build upon. Learn how to use the Rockset console with OpenAI embeddings to perform vector-similarity searches, forming the backbone of our recommender engine. Build a dynamic web application using vanilla CSS, HTML, JavaScript, and Flask, seamlessly integrating with the Rockset API and the OpenAI API. Find an end-to-end Colab notebook that you can run without any dependencies on your local operating system: Recsys_workshop. Int ..read more
Visit website
How We Optimized Rockset's Hot Storage Tier to Improve Efficiency By More Than 200%
Rockset Blog
by Rafael Kabesa
1M ago
The speed and scalability of data used in applications, which pairs closely with its cost, are critical components every development team cares about. This blog describes how we optimized Rockset’s hot storage tier to improve efficiency by more than 200%. We delve into how we architect for efficiency by leveraging new hardware, maximizing the use of available storage, implementing better orchestration techniques and using snapshots for data durability. With these efficiency gains, we were able to reduce costs while keeping the same performance and pass along the savings to users. Rockset’s new ..read more
Visit website
Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset
Rockset Blog
by Julie Mills
1M ago
Today, Confluent announced the general availability of its serverless Apache Flink service. Flink is one of the most popular stream processing technologies, ranked as a top five Apache project and backed by a diverse committer community including Alibaba and Apple. It powers steam processing at many companies including Uber, Netflix, and Linkedin. Rockset customers using Flink often share how challenging it is to self-manage Flink for streaming transformations. That’s why we’re thrilled that Confluent Cloud is making it easier to use Flink, providing efficient and performant stream processing ..read more
Visit website
Profiling Individual Queries in a Concurrent System
Rockset Blog
by Nathan Bronson
1M ago
A good CPU profiler is worth its weight in gold. Measuring performance in-situ usually means using a sampling profile. They provide a lot of information while having very low overhead. In a concurrent system, however, it is hard to use the resulting data to extract high-level insights. Samples don’t include context like query IDs and application-level statistics; they show you what code was run, but not why. This blog introduces trampoline histories, a technique Rockset has developed to efficiently attach application-level information (query IDs) to the samples of a CPU profile. This lets us u ..read more
Visit website
Understanding DynamoDB Secondary Indexes
Rockset Blog
by Alex DeBrie
2M ago
Introduction Indexes are a crucial part of proper data modeling for all databases, and DynamoDB is no exception. DynamoDB's secondary indexes are a powerful tool for enabling new access patterns for your data. In this post, we'll look at DynamoDB secondary indexes. First, we'll start with some conceptual points about how to think about DynamoDB and the problems that secondary indexes solve. Then, we'll look at some practical tips for using secondary indexes effectively. Finally, we'll close with some thoughts on when you should use secondary indexes and when you should look for other solutions ..read more
Visit website
How Klarna Scales Buy Now Pay Later with Real-Time Anomaly Detection
Rockset Blog
by Julie Mills
2M ago
Klarna is a leading buy-now-pay-later company, giving shoppers more time to pay while paying merchants in full upfront. With a number of payment options, including direct payments, pay after delivery and installment plans, Klarna provides shoppers flexibility in how they pay with zero interest. The number of new payment options helps over 500k merchants using Klarna to attract, convert and retain global shoppers. Klarna integrates seamlessly into the payment experience offering one-click purchases, regardless of the payment plan. The flexible options enable shoppers to make larger purchases re ..read more
Visit website
Rockset Ushers in the New Era of Search and AI with a 30% Lower Price
Rockset Blog
by Julie Mills
3M ago
In 2023, Rockset announced a new cloud architecture for search and analytics that separates compute-storage and compute-compute. With this architecture, users can separate ingestion compute from query compute, all while accessing the same real-time data. This is a game changer in disaggregated, real-time architectures. It also unlocks ways to make it easier and cheaper to build applications on Rockset. Today, Rockset releases new features that make search and analytics more affordable than ever before: General purpose instance class: A new ratio of compute and memory resources that is suitabl ..read more
Visit website
How to Update Documents in Elasticsearch
Rockset Blog
by Shawn Adams
3M ago
Elasticsearch is an open-source search and analytics engine based on Apache Lucene. When building applications on change data capture (CDC) data using Elasticsearch, you’ll want to architect the system to handle frequent updates or modifications to the existing documents in an index. In this blog, we’ll walk through the different options available for updates including full updates, partial updates and scripted updates. We’ll also discuss what happens under the hood in Elasticsearch when modifying a document and how frequent updates impact CPU utilization in the system. Example application wit ..read more
Visit website
Mutable Data in Rockset
Rockset Blog
by Luka Lovosevic
4M ago
Data mutability is the ability of a database to support mutations (updates and deletes) to the data that’s stored inside it. It’s a critical feature, especially in real-time analytics where data constantly changes and you need to present the latest version of that data to your customers and end users. Data can arrive late, it can be out of order, it can be incomplete or you might have a scenario where you need to enrich and extend your datasets with additional information for them to be complete. In either case, the ability to change your data is very important. Rockset is fully mutable Rocks ..read more
Visit website
Choosing Between Nested Queries and Parent-Child Relationships in Elasticsearch
Rockset Blog
by Julie Mills
5M ago
Data modeling in Elasticsearch is not as obvious as it is when dealing with relational databases. Unlike traditional relational databases that rely on data normalization and SQL joins, Elasticsearch requires alternative approaches for managing relationships. There are four common workarounds to managing relationships in Elasticsearch: Application-side joins Data denormalization Nested field types and nested queries Parent-child relationships In this blog, we’ll discuss how you can design your data model to handle relationships using the nested field type and parent-child relationships. We’ll ..read more
Visit website

Follow Rockset Blog on FeedSpot

Continue with Google
Continue with Apple
OR