Category: language models

Nvidia Introduces VILA: Visual Language Intelligence and Edge AI 2.0

May 6, 2024

—

by

ross

in AI, Applications, Artificial Intelligence, Edge AI 2.0, Guide, images, Intermediate, IOT, language models, LLM, LLMs, Models, NVIDIA, training, VILA, visual model

Introduction Visual Language Models (VLMs) are revolutionizing the way machines comprehend and interact with both images and text. These models skillfully combine techniques from image processing with the subtleties of language comprehension. This integration enhances the capabilities of artificial intelligence (AI). Nvidia and MIT have recently launched a VLM named VILA, enhancing the capabilities of…
Paramanu-Ganita: A New Mathematical Model that Outperforms LLaMa, Falcon, and PaLM

May 5, 2024

—

by

ross

in AI, Artificial Intelligence, Design, efficiency, Gyan AI, Intermediate, language models, Large Language Models, LLM, LLMs, Mathematics, Maths, Models, Paramanu-Ganita, small language models, training

Introduction Large language models (LLMs) have dramatically reshaped computational mathematics. These advanced AI systems, designed to process and mimic human-like text, are now pushing boundaries in mathematical fields. Their ability to understand and manipulate complex concepts has made them invaluable in research and development. Among these innovations stands Paramanu-Ganita, a creation of Gyan AI Research.…
Building Responsible AI with Guardrails AI

May 3, 2024

—

by

ross

in AI, API, Applications, ChatGPT, Github, Guide, Intermediate, language models, Large Language Models, LLMs, Models, Object, Python, validation

Introduction Large Language Models (LLMs) are ubiquitous in various applications such as chat applications, voice assistants, travel agents, and call centers. As new LLMs are released, they improve their response generation. However, people are increasingly using ChatGPT and other LLMs, which may provide prompts with personal identifiable information or toxic language. To protect against these…
Gecko by Google: Pioneering the Next Generation of Text Embedding Models

May 2, 2024

—

by

ross

in AI, Applications, Artificial Intelligence, blockchain, challenges, Embedding, Guide, Intermediate, language models, LLMs, Machine Learning, Models, NLP, Text, training

Introduction Welcome to the world of text embeddings where text is converted into numbers! This world has recently been turned around by the distillation of large language models (LLMs) into efficient and compact forms. Google’s latest innovation, Gecko, is the lastest advancement in this technology, revolutionizing the way we handle textual data. This article explores…
Finetuning Llama 3 with Odds Ratio Preference Optimization

May 2, 2024

—

by

ross

in Advanced, blogathon, dataset, fine tuning, Generative AI, Guide, HuggingFace, Kaggle, language models, Large Language Models, LLM, LLMs, Models, optimization, PyTorch, Supervised, time, training, Unsupervised

Introduction Large Language Models are often trained rather than built, requiring multiple steps to perform well. These steps, including Supervised Fine Tuning (SFT) and Preference Alignment, are crucial for learning new things and aligning with human responses. However, each step takes a significant amount of time and computing resources. One solution is the Odd Ratio…
Phi 3 – Small Yet Powerful Models from Microsoft

May 1, 2024

—

by

ross

in Advanced, Artificial Intelligence, blogathon, language models, Large Language Models, LLM, LLMs, Microsoft, Models, questions, Supervised, tokenizer

Introduction The Phi model from Microsoft has been at the forefront of many open-source Large Language Models. Phi architecture has led to all the popular small open-source models that we see today which include TPhixtral, Phi-DPO, and others. Their Phi Family has taken the LLM architecture a step forward with the introduction of Small Language…
Microsoft Phi 3 Mini: The Tiny Model That Runs on Your Phone

Apr 25, 2024

—

by

ross

in AI, Artificial Intelligence, Generative AI, Intermediate, language models, Large Language Models, LLMs, Microsoft, Models, Phi-3 Mini, SLM, small language models, training, training data

Introduction In the field of artificial intelligence (AI), there’s always been a belief that bigger is better. But Microsoft has just shaken things up with their latest creation, Phi-3-mini. It’s a small AI model that’s turning heads by showing that size isn’t everything. Despite being much smaller than its counterparts, Phi-3-mini can hold its own…

Category: language models

Nvidia Introduces VILA: Visual Language Intelligence and Edge AI 2.0

Paramanu-Ganita: A New Mathematical Model that Outperforms LLaMa, Falcon, and PaLM

Building Responsible AI with Guardrails AI

Gecko by Google: Pioneering the Next Generation of Text Embedding Models

Finetuning Llama 3 with Odds Ratio Preference Optimization

Phi 3 – Small Yet Powerful Models from Microsoft

Microsoft Phi 3 Mini: The Tiny Model That Runs on Your Phone