Reflect, Ripple, Reinovate

Category: Supervised

Finetuning Llama 3 with Odds Ratio Preference Optimization

May 2, 2024

—

by

ross

in Advanced, blogathon, dataset, fine tuning, Generative AI, Guide, HuggingFace, Kaggle, language models, Large Language Models, LLM, LLMs, Models, optimization, PyTorch, Supervised, time, training, Unsupervised

Introduction Large Language Models are often trained rather than built, requiring multiple steps to perform well. These steps, including Supervised Fine Tuning (SFT) and Preference Alignment, are crucial for learning new things and aligning with human responses. However, each step takes a significant amount of time and computing resources. One solution is the Odd Ratio…
Phi 3 – Small Yet Powerful Models from Microsoft

May 1, 2024

—

by

ross

in Advanced, Artificial Intelligence, blogathon, language models, Large Language Models, LLM, LLMs, Microsoft, Models, questions, Supervised, tokenizer

Introduction The Phi model from Microsoft has been at the forefront of many open-source Large Language Models. Phi architecture has led to all the popular small open-source models that we see today which include TPhixtral, Phi-DPO, and others. Their Phi Family has taken the LLM architecture a step forward with the introduction of Small Language…