Load balancing OpenAI API calls with LiteLLM
baeke.info
by Geert Baeke
1w ago
If you have ever created an application that makes calls to Azure OpenAI models, you know there are limits to the amount of calls you can make per minute. Take a look at the settings of a GPT model below: GPT deployment settings Above, the tokens per minute (TPM) rate limit is set to 60 000 tokens. This translates to about 360 requests per minute. When you exceed these limits, you get 429 Too Many Requests errors. There are many ways to deal with these limits. A few of the main ones are listed below: You can ask for a PAYGO quota increase: remember that high quotas do not necessarily lead to ..read more
Visit website
Use Azure OpenAI on your data with Semantic Kernel
baeke.info
by Geert Baeke
1M ago
I have written before about Azure OpenAI on your data. For a refresher, see Microsoft Learn. In short, Azure OpenAI on your data tries to make it easy to create an Azure AI Search index that supports advanced search mechanisms like vector search, potentially enhanced with semantic reranking. On of the things you can do is simply upload your documents and start asking questions about these documents, right from within the Azure OpenAI Chat playground. The screenshot below shows the starting screen of a step-by-step wizard to get your documents into an index: Upload your documents to Azure OpenA ..read more
Visit website
So you want a chat bot to talk to your SharePoint data?
baeke.info
by Geert Baeke
1M ago
It’s a common request we hear from clients: “We want a chatbot that can interact with our data in SharePoint!” The idea is compelling – instead of relying on traditional search methods or sifting through hundreds of pages and documents, users could simply ask the bot a question and receive an instant, accurate answer. It promises to be a much more efficient and user-friendly experience. The appeal is clear: Improved user experience Time savings Increased productivity But how easy is ..read more
Visit website
Embedding flows created with Microsoft Prompt Flow in your own applications
baeke.info
by Geert Baeke
2M ago
A while ago, I wrote about creating your first Prompt Flow in Visual Studio Code. In this post, we will embed such a flow in a Python application built with Streamlit. The application allows you to search for images based on a description. Check the screenshot below: Streamlit app to search for images based on a description There are a few things we need to make this work: An index in Azure AI Search that contains descriptions of images, a vector of these descriptions and a link to the image A flow in Prompt Flow that takes a description as input and returns the image link or the entire image ..read more
Visit website
Super fast bot creation with Copilot Studio and the Azure OpenAI Assistants API
baeke.info
by Geert Baeke
2M ago
In a previous post, I discussed the Microsoft Bot Framework SDK that provides a fast track to deploying intelligent bots with the help of the Assistants API. Yet, the journey doesn’t stop there. Copilot Studio, a low-code tool, introduces an even more efficient approach, eliminating the need for intricate bot coding. It empowers developers to quickly design and deploy bots, focusing on functionality over coding complexities. In this post, we will combine Copilot Studio with the Assistants API. But first, let’s take a quick look at the basics of Copilot Studio. Copilot Studio Copilot Studio, kn ..read more
Visit website
Fast chat bot creation with the OpenAI Assistants API and the Microsoft Bot Framework SDK
baeke.info
by Geert Baeke
2M ago
This post is part of a series of blog posts about the Azure OpenAI Assistants API. Here are the previous posts: Part 1: introduction Part 2: using tools Part 3: retrieval In all of those posts, we demonstrated the abilities of the Azure OpenAI Assistants API in a Python notebook. In this post, we will build an actual chat application with some help of the Bot Framework SDK. The Bot Framework SDK is a collection of libraries and tools that let you build, test and deploy bot applications. The target audience is developers. They can write the bot in C#, TypeScript or Python. If you are more of ..read more
Visit website
Retrieval with the Azure OpenAI Assistants API
baeke.info
by Geert Baeke
2M ago
In two previous blog posts, I wrote an introduction to the Azure OpenAI Assistants API and how to work with custom functions. In this post, we will take a look at an assistant that can answer questions about documents. We will create an HR Assistant that has access to an HR policy document. In addition, we will provide a custom function that employees can use to request a raise. Retrieval The OpenAI Assistants API (not the one in Azure) supports a retrieval tool. You can simply upload one or more documents, turn on retrieval and you are good to go. The screenshot below shows the experience on ..read more
Visit website
Using tools with the Azure OpenAI Assistants API
baeke.info
by Geert Baeke
2M ago
Introduction In a previous blog post, I wrote an introduction about the Azure OpenAI Assistants API. As an example, I created an assistant that had access to the Code Interpreter tool. You can find the code here. In this post, we will provide the assistant with custom tools. These custom tools use the function calling features of more recent GPT models. As a result, these custom tools are called functions in the Assistants API. What’s in a name right? There are a couple of steps you need to take for this to work: Create an assistant and give it a name and instructions. Define one or more func ..read more
Visit website
A look at the Azure OpenAI Assistants API
baeke.info
by Geert Baeke
2M ago
Introduction A while ago, I looked at the OpenAI Assistants API. In February of 2024, Microsoft have released their Assistants API in public preview. It works in the same way as the OpenAI Assistants API while being able to use it with Azure OpenAI models, deployed to a region of your choice. The goal of the Assistants API is to make it easier for developers to create applications with copilot-like experiences. It should be easier to provide the assistant with extra knowledge or allow the assistant to interact with the world by calling external APIs. If you have ever created a chat-based copil ..read more
Visit website
Deploy a flow created in Prompt Flow with Docker
baeke.info
by Geert Baeke
4M ago
In the previous post, we created a flow with Prompt Flow in Visual Studio Code. The Prompt Flow extension for VS Code has a visual flow editor to test the flow. You simply provide the input and click the Run button. When the flow is finished, the result can be seen in the Outputs node, including a trace of the flow: Running a flow in Prompt Flow Now it’s time to deploy the flow. One of the options is creating a container image with Docker. Before we start, we will first convert this flow into a chat flow. Chat does not make much sense for this flow. However, the Docker container includes a UI ..read more
Visit website

Follow baeke.info on FeedSpot

Continue with Google
Continue with Apple
OR