Tuesday, 16 Sep 2025
Subscribe
logo logo
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
  • 🔥
  • data
  • Secures
  • revolutionizing
  • Funding
  • Investment
  • Future
  • Growth
  • Center
  • technology
  • Series
  • cloud
  • Power
Font ResizerAa
Silicon FlashSilicon Flash
Search
  • Global
  • Technology
  • Business
  • AI
  • Cloud
  • Edge Computing
  • Security
  • Investment
  • More
    • Sustainability
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Silicon Flash > Blog > AI > Enhancing LLM Customization for Real-World Tasks: The Impact of Fine-Tuning and In-Context Learning
AI

Enhancing LLM Customization for Real-World Tasks: The Impact of Fine-Tuning and In-Context Learning

Published May 10, 2025 By Juwan Chacko
Share
4 Min Read
Enhancing LLM Customization for Real-World Tasks: The Impact of Fine-Tuning and In-Context Learning
SHARE



Stay up to date with the latest developments and exclusive content in the field of artificial intelligence by subscribing to our daily and weekly newsletters. Find out more









When it comes to customizing large language models (LLMs) for specific tasks, two common approaches are fine-tuning and in-context learning (ICL). A recent study conducted by researchers from Google DeepMind and Stanford University delved into the generalization capabilities of these methods. The study revealed that ICL demonstrates superior generalization abilities, although it does require higher computation costs during inference. Additionally, the researchers proposed a novel approach to combine the strengths of both methods.



These findings have significant implications for developers looking to build LLM applications tailored to their enterprise data.



Exploring How Language Models Adapt to New Tasks



Fine-tuning involves further training a pre-trained LLM on a specialized dataset to impart new knowledge or skills. In contrast, ICL does not alter the model’s internal parameters but provides examples of the desired task directly within the input prompt to guide the LLM. The model then learns how to handle similar queries based on these examples.



The researchers conducted a rigorous comparison of how well models generalize to new tasks using these two methods. They created synthetic datasets with intricate, self-consistent structures, such as imaginary family trees or hierarchies of fictional concepts, to test the model’s ability to learn new information. To ensure unbiased testing, all nouns, adjectives, and verbs were replaced with nonsensical terms that the LLMs had not encountered during pre-training.



The models were subjected to various generalization challenges, including simple reversals and syllogisms, as well as a more complex semantic structure benchmark. The results highlighted the effectiveness of ICL in promoting better generalization in data-matched settings compared to standard fine-tuning.

See also  Robotics Revolution: Meta's Innovative World Model Empowers Robots to Master Unfamiliar Environments


A Hybrid Approach: Enhancing Fine-Tuning



Building on the superior generalization capabilities of ICL, the researchers introduced a new method to enhance fine-tuning by incorporating in-context inferences into the training data. This approach leverages the LLM’s own ICL abilities to generate diverse examples, which are then added to the fine-tuning dataset.



Two main data augmentation strategies were explored:




  1. A local strategy focused on rephrasing individual sentences or drawing inferences from them.

  2. A global strategy involved providing the full training dataset as context to generate longer reasoning traces of relevant inferences.



When the models were fine-tuned on these augmented datasets, significant improvements in generalization were observed. Augmented fine-tuning not only outperformed standard fine-tuning but also surpassed plain ICL in terms of performance.







This innovative approach presents a promising avenue for enterprises seeking to enhance the generalization capabilities of their fine-tuned models. By incorporating ICL-augmented datasets, developers can create more robust LLM applications that perform effectively across diverse real-world inputs without incurring continuous inference-time costs associated with large in-context prompts.



While augmented fine-tuning may increase the overall training costs, the improved generalization benefits outweigh the expenses, making it a cost-effective solution in the long run. Developers are encouraged to explore augmented fine-tuning in cases where standard fine-tuning alone falls short.



Ultimately, this research contributes to advancing the understanding of learning and generalization in foundation models, offering practical insights for adapting them to various downstream tasks.


TAGGED: Customization, Enhancing, FineTuning, Impact, InContext, Learning, LLM, RealWorld, Tasks
Share This Article
Facebook LinkedIn Email Copy Link Print
Previous Article Toloka Secures  Million in Funding Round Toloka Secures $72 Million in Funding Round
Next Article Unlimited Documentaries: One Subscription, Lifetime Access for 9.97 Unlimited Documentaries: One Subscription, Lifetime Access for $149.97
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
LinkedInFollow

Popular Posts

BWXT Strengthens Portfolio with Kinectrics Acquisition

Summary: BWX Technologies, Inc. has acquired Kinectrics, a Toronto-based provider of nuclear power plant lifecycle…

May 25, 2025

Landmark data centre deal | Data Centre Solutions

Indonesian Banks Provide Debt to BDx Indonesia for Growth Three prominent Indonesian banks - BCA,…

April 20, 2025

Aggreko Bolsters Energy Solutions Portfolio with New Renewable Developments

Aggreko Delivers Renewable Generation Solutions for Businesses in Europe Aggreko, under the leadership of Jon…

May 2, 2025

When AI reasoning goes wrong: Microsoft Research shows more tokens can mean more problems

The Complex Nature of Inference-Time Scaling: Insights from a Microsoft Research Study Join our daily…

April 21, 2025

The Seattle Startup Scene: A Haven for Founders

Seattle is a thriving hub for startups, offering unique advantages for founders looking to build…

July 31, 2025

You Might Also Like

Trailblazing Teen: Owen Cooper’s Impact on Warrington
Technology

Trailblazing Teen: Owen Cooper’s Impact on Warrington

SiliconFlash Staff
Enhancing the Google Pixel Phone Home Screen: 4 Innovative Ideas
Technology

Enhancing the Google Pixel Phone Home Screen: 4 Innovative Ideas

SiliconFlash Staff
Navigating the Waves: A Sea Pilot’s Trial with Radar-Informed AI
AI

Navigating the Waves: A Sea Pilot’s Trial with Radar-Informed AI

Juwan Chacko
AI’s Big Companies Left Behind: The Impact of Selling Coffee Beans to Starbucks
Business

AI’s Big Companies Left Behind: The Impact of Selling Coffee Beans to Starbucks

Juwan Chacko
logo logo
Facebook Linkedin Rss

About US

Silicon Flash: Stay informed with the latest Tech News, Innovations, Gadgets, AI, Data Center, and Industry trends from around the world—all in one place.

Top Categories
  • Technology
  • Business
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2025 – siliconflash.com – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?