Thursday, 26 Jun 2025
Subscribe
logo logo
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
  • 🔥
  • data
  • Secures
  • Funding
  • revolutionizing
  • Investment
  • Series
  • Center
  • cloud
  • Future
  • million
  • Power
  • Growth
Font ResizerAa
Silicon FlashSilicon Flash
Search
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Silicon Flash > Blog > AI > Enhancing LLM Customization for Real-World Tasks: The Impact of Fine-Tuning and In-Context Learning
AI

Enhancing LLM Customization for Real-World Tasks: The Impact of Fine-Tuning and In-Context Learning

Published May 10, 2025 By Juwan Chacko
Share
4 Min Read
Enhancing LLM Customization for Real-World Tasks: The Impact of Fine-Tuning and In-Context Learning
SHARE



Stay up to date with the latest developments and exclusive content in the field of artificial intelligence by subscribing to our daily and weekly newsletters. Find out more









When it comes to customizing large language models (LLMs) for specific tasks, two common approaches are fine-tuning and in-context learning (ICL). A recent study conducted by researchers from Google DeepMind and Stanford University delved into the generalization capabilities of these methods. The study revealed that ICL demonstrates superior generalization abilities, although it does require higher computation costs during inference. Additionally, the researchers proposed a novel approach to combine the strengths of both methods.



These findings have significant implications for developers looking to build LLM applications tailored to their enterprise data.



Exploring How Language Models Adapt to New Tasks



Fine-tuning involves further training a pre-trained LLM on a specialized dataset to impart new knowledge or skills. In contrast, ICL does not alter the model’s internal parameters but provides examples of the desired task directly within the input prompt to guide the LLM. The model then learns how to handle similar queries based on these examples.



The researchers conducted a rigorous comparison of how well models generalize to new tasks using these two methods. They created synthetic datasets with intricate, self-consistent structures, such as imaginary family trees or hierarchies of fictional concepts, to test the model’s ability to learn new information. To ensure unbiased testing, all nouns, adjectives, and verbs were replaced with nonsensical terms that the LLMs had not encountered during pre-training.



The models were subjected to various generalization challenges, including simple reversals and syllogisms, as well as a more complex semantic structure benchmark. The results highlighted the effectiveness of ICL in promoting better generalization in data-matched settings compared to standard fine-tuning.

See also  Understanding Software Development Models Through Real-World Examples


A Hybrid Approach: Enhancing Fine-Tuning



Building on the superior generalization capabilities of ICL, the researchers introduced a new method to enhance fine-tuning by incorporating in-context inferences into the training data. This approach leverages the LLM’s own ICL abilities to generate diverse examples, which are then added to the fine-tuning dataset.



Two main data augmentation strategies were explored:




  1. A local strategy focused on rephrasing individual sentences or drawing inferences from them.

  2. A global strategy involved providing the full training dataset as context to generate longer reasoning traces of relevant inferences.



When the models were fine-tuned on these augmented datasets, significant improvements in generalization were observed. Augmented fine-tuning not only outperformed standard fine-tuning but also surpassed plain ICL in terms of performance.







This innovative approach presents a promising avenue for enterprises seeking to enhance the generalization capabilities of their fine-tuned models. By incorporating ICL-augmented datasets, developers can create more robust LLM applications that perform effectively across diverse real-world inputs without incurring continuous inference-time costs associated with large in-context prompts.



While augmented fine-tuning may increase the overall training costs, the improved generalization benefits outweigh the expenses, making it a cost-effective solution in the long run. Developers are encouraged to explore augmented fine-tuning in cases where standard fine-tuning alone falls short.



Ultimately, this research contributes to advancing the understanding of learning and generalization in foundation models, offering practical insights for adapting them to various downstream tasks.


TAGGED: Customization, Enhancing, FineTuning, Impact, InContext, Learning, LLM, RealWorld, Tasks
Share This Article
Facebook LinkedIn Email Copy Link Print
Previous Article Toloka Secures  Million in Funding Round Toloka Secures $72 Million in Funding Round
Next Article Unlimited Documentaries: One Subscription, Lifetime Access for 9.97 Unlimited Documentaries: One Subscription, Lifetime Access for $149.97
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
LinkedInFollow

Popular Posts

India’s Quantum Leap: Pioneering the Future of Computing

India has set its sights on the future of quantum computing with the launch of…

May 6, 2025

Microsoft, Western Digital Recycle Drives to Recover Rare Earth Metals

In a groundbreaking collaboration, Microsoft and Western Digital have joined forces on an innovative recycling…

April 23, 2025

Navigating Uncertainty: Insights from the 2025 GeekWire Awards

The 2025 GeekWire Awards in Seattle was a groundbreaking event that showcased the brightest minds…

May 3, 2025

Qualcomm and E& Forge Partner to Empower UAE’s Digital Infrastructure with Edge AI Technology

Summary: Qualcomm and e& Forge have partnered to drive digital transformation in the UAE through…

June 2, 2025

Uncovering the State of Innovation: The Critical Need for Balanced R&D Investment

Summary: 1. Washington's patent filings have declined significantly despite strong investment in research and development.…

May 22, 2025

You Might Also Like

Identity as the Foundation: Safeguarding Enterprise AI Security
AI

Identity as the Foundation: Safeguarding Enterprise AI Security

Juwan Chacko
Maximizing AI Potential: IBM’s Approach to Matching LLM with Enterprise Use Cases
AI

Maximizing AI Potential: IBM’s Approach to Matching LLM with Enterprise Use Cases

Juwan Chacko
Guarding Against the Dangers of AI Deepfakes: Safeguarding Internet Freedom
AI

Guarding Against the Dangers of AI Deepfakes: Safeguarding Internet Freedom

Juwan Chacko
The Impact of Public Perception on the UK’s AI Revolution
Global Market

The Impact of Public Perception on the UK’s AI Revolution

Juwan Chacko
logo logo
Facebook Linkedin Rss

About US

Silicon Flash: Stay informed with the latest Tech News, Innovations, Gadgets, AI, Data Center, and Industry trends from around the world—all in one place.

Top Categories
  • Technology
  • Business
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2025 – siliconflash.com – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?