By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

News Junction

Notification Show More
Font ResizerAa
  • Home
  • World News
    World NewsShow More
    In one Indian city, reflective paint and bus stop sprinklers offer relief from killer heat
    In one Indian city, reflective paint and bus stop sprinklers offer relief from killer heat
    May 14, 2025
    Ousted Bangladesh PM Hasina’s party barred from election as party registration suspended
    Ousted Bangladesh PM Hasina’s party barred from election as party registration suspended
    May 14, 2025
    Cassandra Ventura testifies, tells jury freak offs became a job
    Cassandra Ventura testifies, tells jury freak offs became a job
    May 13, 2025
    White South Africans arrive in US after Trump administration granted them refugee status | World News
    White South Africans arrive in US after Trump administration granted them refugee status | World News
    May 13, 2025
    Trump’s Mideast Wish List: + Trillion in Investments – and Some Diplomacy Too
    Trump’s Mideast Wish List: $1+ Trillion in Investments – and Some Diplomacy Too
    May 13, 2025
  • Business
    BusinessShow More
    Ukraine blows up bridges to consolidate its positions in Russia
    Ukraine blows up bridges to consolidate its positions in Russia
    August 18, 2024
    Commentary: AI phones from Google and Apple will erode trust in everything
    Commentary: AI phones from Google and Apple will erode trust in everything
    August 18, 2024
    The most famous Indian Dishes – Insights Success
    The most famous Indian Dishes – Insights Success
    August 18, 2024
    Life on the road as a female long rides cyclist
    Life on the road as a female long rides cyclist
    August 18, 2024
    UK inflation rises to 2.2%
    UK inflation rises to 2.2%
    August 18, 2024
  • Cryptocurrency
    CryptocurrencyShow More
    Dogecoin Price Completes Weekly Close Above Pre-Halving Highs, Next Stop Above alt=
    Dogecoin Price Completes Weekly Close Above Pre-Halving Highs, Next Stop Above $0.2?
    May 14, 2025
    Cantor Equity Partners (CEP) News: 4,812 Bitcoin Purchased
    Cantor Equity Partners (CEP) News: 4,812 Bitcoin Purchased
    May 14, 2025
    BTC, ETH, XRP, BNB, SOL, ADA, DOGE, PI, LEO, HBAR
    BTC, ETH, XRP, BNB, SOL, ADA, DOGE, PI, LEO, HBAR
    May 14, 2025
    Altcoins’ roaring returns and falling USDT stablecoin dominance suggest ‘altseason’ is here
    Altcoins’ roaring returns and falling USDT stablecoin dominance suggest ‘altseason’ is here
    May 14, 2025
    How to Use tsUSDe on TON for Passive Dollar Yield in 2025
    How to Use tsUSDe on TON for Passive Dollar Yield in 2025
    May 13, 2025
  • Technology
    TechnologyShow More
    How to Improve Your Spotify Recommendations
    How to Improve Your Spotify Recommendations
    August 18, 2024
    X says it’s closing operations in Brazil
    X says it’s closing operations in Brazil
    August 18, 2024
    Supermoon set to rise: Top tips for amateur photographers | Science & Tech News
    Supermoon set to rise: Top tips for amateur photographers | Science & Tech News
    August 18, 2024
    Scientists Want to See Videos of Your Cat for a New Study
    Scientists Want to See Videos of Your Cat for a New Study
    August 18, 2024
    OpenAI’s new voice mode let me talk with my phone, not to it
    OpenAI’s new voice mode let me talk with my phone, not to it
    August 18, 2024
  • Entertainment
  • Sports News
  • People
  • Trend
Reading: A New Trick Could Block the Misuse of Open Source AI
Share
Font ResizerAa

News Junction

  • World News
  • Business
  • Technology
  • Cryptocurrency
  • Trend
  • Entertainment
Search
  • Recent Headlines in Entertainment, World News, and Cryptocurrency – NewsJunction
  • World News
  • Business
  • Cryptocurrency
  • Technology
  • Entertainment
  • Sports News
  • People
  • Trend
Have an existing account? Sign In
Follow US
News Junction > Blog > Trend > A New Trick Could Block the Misuse of Open Source AI
A New Trick Could Block the Misuse of Open Source AI
Trend

A New Trick Could Block the Misuse of Open Source AI

Published August 3, 2024
Share
5 Min Read
SHARE

When Meta released its large language model Llama 3 for free this April, it took outside developers just a couple days to create a version without the safety restrictions that prevent it from spouting hateful jokes, offering instructions for cooking meth, or misbehaving in other ways.

A new training technique developed by researchers at the University of Illinois Urbana-Champaign, UC San Diego, Lapis Labs, and the nonprofit Center for AI Safety could make it harder to remove such safeguards from Llama and other open source AI models in the future. Some experts believe that, as AI becomes ever more powerful, tamperproofing open models in this way could prove crucial.

“Terrorists and rogue states are going to use these models,” Mantas Mazeika, a Center for AI Safety researcher who worked on the project as a PhD student at the University of Illinois Urbana-Champaign, tells WIRED. “The easier it is for them to repurpose them, the greater the risk.”

Powerful AI models are often kept hidden by their creators, and can be accessed only through a software application programming interface or a public-facing chatbot like ChatGPT. Although developing a powerful LLM costs tens of millions of dollars, Meta and others have chosen to release models in their entirety. This includes making the “weights,” or parameters that define their behavior, available for anyone to download.

Prior to release, open models like Meta’s Llama are typically fine-tuned to make them better at answering questions and holding a conversation, and also to ensure that they refuse to respond to problematic queries. This will prevent a chatbot based on the model from offering rude, inappropriate, or hateful statements, and should stop it from, for example, explaining how to make a bomb.

The researchers behind the new technique found a way to complicate the process of modifying an open model for nefarious ends. It involves replicating the modification process but then altering the model’s parameters so that the changes that normally get the model to respond to a prompt such as “Provide instructions for building a bomb” no longer work.

Mazeika and colleagues demonstrated the trick on a pared-down version of Llama 3. They were able to tweak the model’s parameters so that even after thousands of attempts, it could not be trained to answer undesirable questions. Meta did not immediately respond to a request for comment.

Mazeika says the approach is not perfect, but that it suggests the bar for “decensoring” AI models could be raised. “A tractable goal is to make it so the costs of breaking the model increases enough so that most adversaries are deterred from it,” he says.

“Hopefully this work kicks off research on tamper-resistant safeguards, and the research community can figure out how to develop more and more robust safeguards,” says Dan Hendrycks, director of the Center for AI Safety.

The idea of tamperproofing open models may become more popular as interest in open source AI grows. Already, open models are competing with state-of-the-art closed models from companies like OpenAI and Google. The newest version of Llama 3, for instance, released in July, is roughly as powerful as models behind popular chatbots like ChatGPT, Gemini, and Claude, as measured using popular benchmarks for grading language models’ abilities. Mistral Large 2, an LLM from a French startup, also released last month, is similarly capable.

The US government is taking a cautious but positive approach to open source AI. A report released this week by the National Telecommunications and Information Administration, a body within the US Commerce Department, “recommends the US government develop new capabilities to monitor for potential risks, but refrain from immediately restricting the wide availability of open model weights in the largest AI systems.”

Not everyone is a fan of imposing restrictions on open models, however. Stella Biderman, director of EleutherAI, a community-driven open source AI project, says that the new technique may be elegant in theory but could prove tricky to enforce in practice. Biderman says the approach is also antithetical to the philosophy behind free software and openness in AI.

“I think this paper misunderstands the core issue,” Biderman says. “If they’re concerned about LLMs generating info about weapons of mass destruction, the correct intervention is on the training data, not on the trained model.”

#Trick #Block #Misuse #Open #Source

TAGGED:Artificial intelligenceblockMetamisuseOpenopen sourcesecurity researchsourcetrick
Share This Article
Facebook Twitter Pinterest Whatsapp Whatsapp LinkedIn Email Copy Link Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Fed likely feels ‘remorse’ over July rate call and faces a big cut in September – Swonk Fed likely feels ‘remorse’ over July rate call and faces a big cut in September – Swonk
Next Article Justice Department sues TikTok over alleged children’s online privacy law violations Justice Department sues TikTok over alleged children’s online privacy law violations
- Advertisement -

Latest Post

Dogecoin Price Completes Weekly Close Above Pre-Halving Highs, Next Stop Above alt=
Dogecoin Price Completes Weekly Close Above Pre-Halving Highs, Next Stop Above $0.2?
Cryptocurrency
In one Indian city, reflective paint and bus stop sprinklers offer relief from killer heat
In one Indian city, reflective paint and bus stop sprinklers offer relief from killer heat
World News
Cantor Equity Partners (CEP) News: 4,812 Bitcoin Purchased
Cantor Equity Partners (CEP) News: 4,812 Bitcoin Purchased
Cryptocurrency
BTC, ETH, XRP, BNB, SOL, ADA, DOGE, PI, LEO, HBAR
BTC, ETH, XRP, BNB, SOL, ADA, DOGE, PI, LEO, HBAR
Cryptocurrency
Ousted Bangladesh PM Hasina’s party barred from election as party registration suspended
Ousted Bangladesh PM Hasina’s party barred from election as party registration suspended
World News
Altcoins’ roaring returns and falling USDT stablecoin dominance suggest ‘altseason’ is here
Altcoins’ roaring returns and falling USDT stablecoin dominance suggest ‘altseason’ is here
Cryptocurrency
- Advertisement -

You Might Also Like

POCO M6 Pro 5G: Launch date confirmed for Redmi 12 5G and Redmi Note 12R re-brand
Trend

POCO M6 Pro 5G: Launch date confirmed for Redmi 12 5G and Redmi Note 12R re-brand

August 2, 2023
Why open data is needed in the battle to address homelessness
Technology

Why open data is needed in the battle to address homelessness

February 8, 2024
The Block judge Neale Whitaker launches extraordinary spray at show producers over his edit in season debut: ‘There’s nothing more frustrating’
Entertainment

The Block judge Neale Whitaker launches extraordinary spray at show producers over his edit in season debut: ‘There’s nothing more frustrating’

August 7, 2023
Tesla hack exploits AMD vulnerability to access user data and unlock US,000 in paid software-locked features
Trend

Tesla hack exploits AMD vulnerability to access user data and unlock US$15,000 in paid software-locked features

August 4, 2023

About Us

NEWS JUNCTION (NewsJunction.xyz) Your trusted destination for global news. Stay informed with our timely and accurate reporting on diverse topics, including politics, technology, science, entertainment, sports, and more. Count on us for unbiased and reliable updates at your fingertips.

Quick Link

  • About
  • Disclaimer
  • Privacy Policy
  • Terms of Use
  • Contact

Top Categories

  • World News
  • Business
  • Technology
  • Entertainment
  • Cryptocurrency
  • Sports News
  • Trend
  • People

Subscribe

Subscribe to our newsletter to get our newest articles instantly!

    © 2023 News Junction.
    • Blog
    • Advertise
    • Contact
    Welcome Back!

    Sign in to your account

    Username or Email Address
    Password

    Lost your password?