Google Developers Experts on Feedspot

[ML Story] Part 3: Deploy Gemma on Android

Google Developers Experts

by Nitin Tiwari

13h ago

Written in collaboration with AI/ML GDE Aashi Dutt. Introduction In the preceding two articles, we successfully learned how to prepare your own dataset and fine-tune the Gemma model using supervised fine-tuning with LoRA. Just as a journey needs companions, an ML project needs its model deployed. It’s like peanut butter without jelly — incomplete. In the final part of this series, we’ll walk you through deploying the fine-tuned Gemma model on Android, enabling you to have a complete end-to-end application at your fingertips. Before we begin, ensure that you have Android Studio instal ..read more

Visit website

Enhancing Web Image Accessibility for Visually Impaired Individuals with Gemini Pro Vision and…

Google Developers Experts

by Cyrus Wong

13h ago

Enhancing Web Image Accessibility for Visually Impaired Individuals with Gemini Pro Vision and Google Cloud Platform Problem The inability of visually impaired individuals access image information due to the lack of adherence to W3C web accessibility initiatives by websites. Currently, about 60% of websites lack meaningful alternate text for their images. Moreover, it is unfeasible to retroactively add descriptive text to all existing websites manually. Two mins short Project Introduction and our story https://medium.com/media/350efbd62c0a343dc4dec2353120f5a9/href Demo 1 https://medi ..read more

Visit website

Kotlin Coroutine mechanisms: runBlocking v. launch

Google Developers Experts

by mvndy

14h ago

Introduction to coroutine behavior through playful examples Sometimes you think you know coroutines and then after a while, you’re like “wait, do I really know coroutines?” This series serves as spin off from Programming Android with Kotlin: Achieving Structured Concurrency with Coroutines intended to help strengthen everyday coroutine understanding through playful explorations. We [the authors] always had sincere intentions with the book: While [coroutine] concepts are important if you want to master coroutines, you don’t have to understand everything right now to get started and be ..read more

Visit website

AI Image Classification for PEZ Collectors | Vertex AI & MediaPipe on Android

Google Developers Experts

by Mike Wolfson

14h ago

I’ve always had a soft spot for PEZ dispensers and have been collecting them for over 30 years. These quirky little collectibles come in an incredible variety of shapes and characters. But there’s more to PEZ than just fun; some dispensers can be quite valuable depending on their age and variation. To address the challenge of identification, I harnessed the power of AI to create an image classification model that could help identify these subtle differences. Use Case: Identifying PEZ dispensers Identifying the precise type of PEZ dispenser isn’t always easy. Take Mickey Mouse dispensers, for e ..read more

Visit website

[Mar 2024] ML Community — Highlights and Achievements

Google Developers Experts

by Nari Yoon

4d ago

[Mar 2024] ML Community — Highlights and Achievements Let’s explore highlights and accomplishments of the vast Google Machine Learning communities over the month. We appreciate all the activities and commitment by the community members. Without further ado, here are the key highlights! Featured Stories ML Developer’s Journey AI/ML GDE Rubens Zimbres (Brazil) shared how he increased the number of readers and followers on his Medium channel. From Mar 2023 to Mar 2024 (1 year), readers have increased by 700% and followers increased by 800%! One of his articles, Augmenting Gemini-1.0-Pro with Kno ..read more

Visit website

AI Chef: Turning Food Photos into Recipes with Gemini Vision Pro in Colab

Google Developers Experts

by Esther Irawati Setiawan

5d ago

Have you ever stared at a photo of a delicious dish and wondered what it was or how to make it? With the power of AI and image recognition, that question can now be a thing of the past. In this article, we’ll explore how to leverage Gemini Vision Pro, a large language model from Google AI, alongside Colaboratory (Colab), a free Jupyter Notebook environment in the cloud, to generate recipes based on an image. What is Gemini Vision Pro? Gemini Vision Pro is a cutting-edge vision model from Google AI capable of understanding and interpreting visual content. It can analyze images and ext ..read more

Visit website

Using Gemini 1.5 Pro to create video trailers

Google Developers Experts

by Dimitre Oliveira

5d ago

Taking advantage of the Gemini's multi-modal input to create trailers for any videos. This year on February 15, Google announced the release of Gemini 1.5, this new version brought many improvements, and on top of impressive improvements in the language domain, this model can process a huge input context of up to 1 million tokens, to make it even better it is was trained as a multimodal model, this means that is can natively process text, images, audio or video. This combination of different input types and huge context got me excited with the opportunity to process long videos, so I rev ..read more

Visit website

[ML Story]Multi-modal LLMs made easy: photo & video reasoning with Gemini 1.5 Pro

Google Developers Experts

by Gabriel Moreira

1w ago

Screenshot of Google AI Studio with Gemini 1.5 Pro model selected, and my multi-modal prompt with my L.A. trip photos How accelerated has been the evolution of Generative AI technologies! People are impressed by Multi-modal LLMs, that can understand and generate text, images, videos and audio using a single end-to-end model. In this post, I demonstrate how to use a great recent multimodal LLM — Gemini 1.5 Pro — for the se case of generating a blog post solely from photos and videos taken on a trip. In the end, I also talk briefly about some popular multi-modal LLM architectures and public ..read more

Visit website

Leveraging Gemini 1.5 Multimodal model(Generative AI) for Software development

Google Developers Experts

by Monika Kumar Jethani

1w ago

Image Source: https://dataedo.com/asset/img/banners/blog/cartoons.png Google recently launched Gemini 1.5 Pro model, which is a mid-sized multimodal model optimised for scaling across wide range of tasks. In this blog, we will learn how Gemini 1.5 Pro model can help us during software development. This blog is an improved and recent version of my previous blog, How Generative AI improves the productivity of Software developers All examples in this blog use the freeform prompt in Google AI Studio and Gemini 1.5 pro model. Below are some of the ways in which Gemini 1.5 Pro can help sof ..read more

Visit website

Fine Tuning Gemma-2b to Solve Math Problems

Google Developers Experts

by Rubens Zimbres

1w ago

Mathematical word problem-solving has long been recognized as a complex task for small language models (SLMs). To reach a good level of performance with these models, researchers often train SLMs to generate Python code or by using ensembling techniques, associated with consensus or majority vote. The challenge here is to use Google’s Gemma model, with less than 2 billion parameters and with safeguards against generating code to solve these Grade School Math problems. Here I will use Microsoft’s Orca-Math dataset, a high quality synthetic dataset of 200K math problems obtained through a multi ..read more

Visit website

Follow Google Developers Experts on FeedSpot