FineTuning

Fine-tuning vs. in-context learning: New research guides better LLM customization for real-world tasks

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Two popular approaches for customizing large language models (LLMs) for downstream tasks are fine-tuning and in-context learning (ICL). In a recent study, researchers at Google DeepMind and Stanford University explored the generalization capabilities of these two…

The TAO of data: How Databricks is optimizing AI LLM fine-tuning without data labels

ellonjohns3 months ago011 mins

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More AI models perform only as well as the data used to train or fine-tune them. Labeled data has been a foundational element of machine learning (ML) and generative AI for much of their history. Labeled data…

This AI Paper Explores Long Chain-of-Thought Reasoning: Enhancing Large Language Models with Reinforcement Learning and Supervised Fine-Tuning

ellonjohns4 months ago08 mins

Large language models (LLMs) have demonstrated proficiency in solving complex problems across mathematics, scientific research, and software engineering. Chain-of-thought (CoT) prompting is pivotal in guiding models through intermediate reasoning steps before reaching conclusions. Reinforcement learning (RL) is another essential component that enables structured reasoning, allowing models to recognize and correct errors efficiently. Despite these advancements,…

Fine-Tuning Llama 3.2 3B Instruct for Python Code: A Comprehensive Guide with Unsloth

ellonjohns4 months ago010 mins

In this tutorial, we’ll walk through how to set up and perform fine-tuning on the Llama 3.2 3B Instruct model using a specially curated Python code dataset. By the end of this guide, you’ll have a better understanding of how to customize large language models for code-related tasks and practical insight into the tools and…

Fine-Tuning an Open-Source LLM with Axolotl Using Direct Preference Optimization (DPO) — SitePoint

ellonjohns6 months ago015 mins

LLMs have unlocked countless new opportunities for AI applications. If you’ve ever wanted to fine-tune your own model, this guide will show you how to do it easily and without writing any code. Using tools like Axolotl and DPO, we’ll walk through the process step by step. What Is an LLM? A Large Language Model…

Highlights

I tried using the Legion Go S handheld as my work PC – here’s what happened next!

Unlocking compound semiconductor manufacturing’s potential requires yield management

Announcing our 2025 VB Transform Innovation Showcase finalists

How to Build an Advanced BrightData Web Scraper with Google Gemini for AI-Powered Data Extraction

Category Collection

Fine-tuning vs. in-context learning: New research guides better LLM customization for real-world tasks

The TAO of data: How Databricks is optimizing AI LLM fine-tuning without data labels

This AI Paper Explores Long Chain-of-Thought Reasoning: Enhancing Large Language Models with Reinforcement Learning and Supervised Fine-Tuning

Fine-Tuning Llama 3.2 3B Instruct for Python Code: A Comprehensive Guide with Unsloth

Fine-Tuning an Open-Source LLM with Axolotl Using Direct Preference Optimization (DPO) — SitePoint