Vedcraft
610 FOLLOWERS
A place for software architects to learn, grow & share knowledge in architecture, design, code & technology trends like Cloud, DevOps, Microservices.
Vedcraft
3w ago
For the last six years (starting in 2018), AI investors Nathan Benaich and Air Street Capital have consistently published the State of AI Report. This article summarizes key takeaways from the 2024 report published here, along with the author’s interpretation of model cards, industry reports, research reports, and more available publicly. #1 – Frontier lab performance converges, but OpenAI maintains its edge following the launch of o1 (aka Strawberry) When…
The post Key Takeaways from State of AI Report 2024 appeared first on Vedcraft ..read more
Vedcraft
1M ago
An illustrative blog with a reference application applying Snowflake Cortex With Generative AI in mainstream adoption, Snowflake has shared its Generative AI Vision in the past to bring Gen AI and LLMs closer to the customers’ data. Snowflake’s Cortex AI is their fully managed service (GA on May 24) to manage LLMs and the entire lifecycle for diverse business and technical users: This article shares the key architecture and design approaches and related components…
The post Building Gen AI Apps with Snowflake Cortex as a Foundational AI Platform appeared first on Vedcraft ..read more
Vedcraft
6M ago
Observing technology trends by analysts, research companies, and thought leaders provides a broader perspective. It not only helps software architects to understand the impact of technologies being adopted but also helps them build the right skill set for themselves or their team.
This article synthesizes the top ten technology trends for 2024 and beyond based on my broader research, technology insights, observations, and industry experience.
#1 — Generative AI and AI Platforms
Generative AI appears as a key technology trends by Gartner Top Ten Strategic Technology Trend ..read more
Vedcraft
6M ago
Over the years in-memory computing has evolved and played a significant role in modern architecture primarily as part of Data Platform solutions. In-memory data grid and in-memory database/datastore technology are part of the data platform ecosystem. Click here to read the previous article on the importance of caching and in-memory data grid technology.
Context
Before we deep-dive into the decision tree to make the selection, here is the definition of both the terminologies:
In-memory database/datastore: A database technology relying on memory (RAM) for data storage and retrieval instead of d ..read more
Vedcraft
10M ago
#1 – Google’s launch of Gemini & Gemini 1.5
Google launched Gemini 1.5 on Feb 15, 2024, which transcends all other models with a 128K token window with the capability to support up to 1 million tokens per context window.
Gemini 1.5 use-cases: Research over a very large amount of data processing by LLM including 1 hour of video, 11 hours of audio, codebases with over 30,000 lines of code or over 700,000 words
Google launched Gemini models (Ultra, Pro, and Nano) in Dec 2023. Gemini Ultra outperformed MMLU (massive multitask language understanding), which uses 50+ subject areas.
Google Gemin ..read more
Vedcraft
10M ago
Table of Contents
[Open][Close]
Introduction
Open Source LLMs
#1 – Llama 2
#2 – Falcon
#3 – BLOOM
#4 – MPT-30B and MPT-7B
#5 – Mistral 7B
Conclusion
References
Introduction
Large Language Models (LLMs) do not need any introduction with the rise of its massive adoption in the industry. Most enterprises have either adopted or planning to adopt an LLM to build Generative AI-based enterprise applications supporting a variety of business use cases.
While there are many closed-source options available such as – OpenAI’s GPT-3.5 or GPT-4, Google’s Gemini, etc., Open source LLMs have start ..read more
Vedcraft
1y ago
Since its inception in 2015, the CNCF community has evolved as a global community supporting open source, and vendor-neutral cloud-native technologies. The North American KubeCon + CloudNativeCon 2023 event in Chicago (Nov. 6 to 9) consisted of 80+ different sessions across 10+ tracks covering the broader spectrum of the Cloud-native landscape.
While I have attended KubeCon multiple times, this has been my first in-person attendance. This article shares key takeaways based on my observations based on sessions I attended at KubeCon + CloudNativeCon.
#1 — It is the place for ..read more
Vedcraft
1y ago
Amazon Bedrock, which was announced in April 2023, has drawn a lot of attention from businesses looking to leverage their existing AWS architecture for building Generative AI applications. Amazon announced the General Availability (GA) of Bedrock on September 28 — a service that offers a choice of Generative AI models from Amazon and third-party providers through an API-based interface.
With the current market landscape (high demand for leveraging GAI for building innovative business capabilities) and increasing competitive offerings (Azure’s OpenAI partnership & ..read more
Vedcraft
1y ago
Meta released Code Llama, a large language model (LLM) that can use text prompts to generate and discuss code, on August 24, 2023. It has been built on Llama 2 as a foundational model and is free for research and commercial use. Click here to read the news annoucment published by Meta.
The below visualization depicts the foundational building block of Llama 2, and an approach to build your own custom model on top of it:
Key deliverables/artifacts, which can be accessed by following links below:
GitHub Repository for Code LLama
Request Access to Download Code LLama
Research Paper on Code Lla ..read more
Vedcraft
1y ago
HashiCorp announced on August 10, 2023, that they are adopting Business Source License from Mozilla Public License v2.0 (MPL 2.0). Click here to read the announcement and they have also published a FAQ page.
This will require additional consideration if you are using their key product offerings – Terraform, Packer, Vault, Boundary, Consul, Nomad, Waypoint, and Vagrant.
Source: https://www.hashicorp.com/blog/categories/products-technology
This has raised many questions in the Open source community, particularly because of the limitations as part of the BSL license:
BSL is an alternative ..read more