Apple MLX Vision LLM Server with Ngrok, FastAPI and Sparrow
Andrejus Baranovskis's Blog
by Andrej Baranovskij
3d ago
I show how I run Apple MLX backend on my local Mac Mini M4 Pro 64GB and access it from the Web through Ngrok, with automatically provisioned HTTPS certificate.    ..read more
Visit website
Vision LLM Structured Output with Sparrow
Andrejus Baranovskis's Blog
by Andrej Baranovskij
1w ago
I show how Sparrow UI Shell works with both image and PDF docs to process and extract structured data with Vision LLM (Qwen2) in the MLX backend.    ..read more
Visit website
Stateless MLX Inference with FastAPI in Sparrow
Andrejus Baranovskis's Blog
by Andrej Baranovskij
1M ago
I show how to run inference with MLX in stateless mode, when loaded model is released after inference completes. This is useful when inference requests are less frequent and it helps to reclaim resources reserved by MLX.   ..read more
Visit website
Streamlined Table Data Extraction with Sparrow | Table Transformer, Qwen2 VL, MLX, & Mac Mini M4 Pro
Andrejus Baranovskis's Blog
by Andrej Baranovskij
1M ago
Learn how to streamline table data extraction with Sparrow, Table Transformer, Qwen2 VL, and MLX on the Mac Mini M4 Pro. Simplify your workflow and get accurate results!    ..read more
Visit website
Structured Output from Multipage PDF with Sparrow (Qwen2 Vision LLM and MLX)
Andrejus Baranovskis's Blog
by Andrej Baranovskij
1M ago
I explain how multipage PDFs are handled in Sparrow to extract structured data in a single call.    ..read more
Visit website
Sparrow Apple MLX Backend on Mac Mini M4 (Qwen2 72B 4bit)
Andrejus Baranovskis's Blog
by Andrej Baranovskij
1M ago
I show how I’m running the Qwen2 72B 4bit model locally on a Mac Mini M4 for Sparrow’s backend. MLX (and MLX-VLM) is the main platform I’m using for local data extraction in Sparrow.    ..read more
Visit website
Batch Inference with Qwen2 Vision LLM (Sparrow)
Andrejus Baranovskis's Blog
by Andrej Baranovskij
2M ago
I'm explaining several hints how to optimize Qwen2 Visual LLM performance for batch processing.    ..read more
Visit website
Visual LLM Structured Output Validation with Sparrow
Andrejus Baranovskis's Blog
by Andrej Baranovskij
2M ago
I explain how Sparrow validates the structured output of visual LLMs to ensure it complies with the JSON schema provided in the query. This process helps prevent errors and hallucinations generated by the LLM.   ..read more
Visit website
Extracting Financial Market Stock Data from Images with Vision LLM
Andrejus Baranovskis's Blog
by Andrej Baranovskij
2M ago
In this video, I demonstrate how to extract financial market stock data from images using the powerful Vision LLM Qwen2, all within a Gradio interface. This setup allows quick and easy extraction of key stock stats from screenshots and other image-based data sources—perfect for analysts, traders, and finance enthusiasts looking to streamline data processing. Watch to see how this AI tool can simplify your workflow and make stock data analysis faster and more efficient!    ..read more
Visit website
Structured Output Example with Sparrow UI Shell
Andrejus Baranovskis's Blog
by Andrej Baranovskij
2M ago
Structured output is all you need. I deployed a Sparrow demo UI with Gradio to demonstrate the output Sparrow can produce by running a JSON schema query. You can see examples for the Bonds table, Lab results, and Bank statement.    ..read more
Visit website

Follow Andrejus Baranovskis's Blog on FeedSpot

Continue with Google
Continue with Apple
OR