Energy News
ROBO SPACE
Why tech firms are aiming for smaller, leaner AI models
Why tech firms are aiming for smaller, leaner AI models
By Daxia ROJAS
Paris (AFP) Dec 3, 2024

AI firms have long boasted about the enormous size and capabilities of their products, but they are increasingly looking at leaner, smaller models that they say will save on energy and cost.

Programs like ChatGPT are underpinned by algorithms known as "large language models", and the chatbot's creator bragged last year that its GPT-4 model had nearly two trillion "parameters" -- the building blocks of the models.

The vast size of GPT-4 allows ChatGPT to handle queries about anything from astrophysics to zoology.

But if a company needs a program with knowledge only of, say, tigers, the algorithm can be much smaller.

"You don't need to know the terms of the Treaty of Versailles to answer a question about a particular element of engineering," said Laurent Felix of Ekimetrics, a firm that advises companies on AI and sustainability.

Google, Microsoft, Meta and OpenAI have all started offering smaller models.

Amazon too allows for all sizes of models on its cloud platform.

Kara Hurst, Amazon's chief sustainability officer, said at a recent event in Paris that it showed the tech industry was moving towards "sobriety and frugality".

- Energy needs -

Smaller models are better for simple tasks like summarising and indexing documents or searching an internal database.

US pharmaceutical company Merck, for example, is developing a model with Boston Consulting Group (BCG) to understand the impact of certain diseases on genes.

"It will be a very small model, between a few hundred million and a few billion parameters," said Nicolas de Bellefonds, head of AI at BCG.

Laurent Daudet, head of French AI startup LightOn, which specialises in smaller models, said they had several advantages over their larger siblings.

They were often faster and able to "respond to more queries and more users simultaneously", he said.

He also pointed out that they were less energy hungry -- the potential climate impact being one of the major concerns over AI.

Huge arrays of servers are needed to "train" the AI programs and then to process queries.

These servers -- made up of highly advanced chips -- require vast amounts of electricity both to fuel their operation and to cool them down.

Daudet explained that the smaller models needed far fewer chips, making them cheaper and more energy efficient.

- Multi-model future -

Other proponents point out that they can run without using data centres altogether by being installed directly on devices.

"This is one of the ways to reduce the carbon footprint of our models," Arthur Mensch, head of French start-up Mistral AI, told the Liberation newspaper in October.

Laurent Felix pointed out that direct use on a device also meant more "security and confidentiality of data".

The programs could potentially be trained on proprietary data without fear of it being compromised.

The larger programs, though, still have the edge for solving complex problems and accessing wide ranges of data.

De Bellefonds said the future was likely to involve both kinds of models talking to each other.

"There will be a small model that will understand the question and send this information to several models of different sizes depending on the complexity of the question," he said.

"Otherwise, we will have solutions that are either too expensive, too slow, or both."

dax/jxb/rl

Merck & Co.

GOOGLE

MICROSOFT

Meta

Amazon.com

Related Links
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
New datasets aim to teach AI models cross-disciplinary scientific thinking
Los Angeles CA (SPX) Dec 03, 2024
What can exploding stars reveal about blood flow in arteries, or how might swimming bacteria inform our understanding of ocean dynamics? Researchers from leading institutions have taken a major step forward in training artificial intelligence (AI) models to draw insights across disciplines to unlock scientific discoveries. The initiative, known as Polymathic AI, leverages advanced technology similar to large language models like ChatGPT, but instead of processing text, it uses datasets from fields ... read more

ROBO SPACE
Brazil trumpets emission cut plans at UN top court

Earning money while supporting power grid stability

Ukraine says energy sector 'under massive enemy attack'

Contentious COP29 deal casts doubt over climate plans

ROBO SPACE
Approaching plasma dynamics with advanced data techniques

KSTAR launches 2024 plasma experiments to refine fusion reactor technologies

Breakthrough in heat-to-electricity conversion demonstrated in tungsten disilicide

Bolivia announces $1 bn deal with China to build lithium plants

ROBO SPACE
Baltic Sea wind farms impair Sweden's defence, says military

Sweden blocks 13 offshore wind farms over defence concerns

Sweden's defence concerned by planned offshore wind power

On US coast, wind power foes embrace 'Save the Whales' argument

ROBO SPACE
Record efficiency achieved with perovskite and organic tandem solar cells

A new protocol to enhance flexible solar technology durability

How efficient solar cells can be made with non-toxic processes

Revolv Space prepares for inaugural in-orbit test of SARA system

ROBO SPACE
UK nuclear plants to stay open longer in cleaner power boost

France's Orano says Niger junta controls uranium firm

Serbia lifts moratorium on nuclear power

Cheers, angst as US nuclear plant Three Mile Island to reopen

ROBO SPACE
A new catalyst can turn methane into something useful

Liquid Sun secures funding to scale sustainable aviation fuel production

Turning emissions into renewable methane fuel

Turning automotive engines into modular chemical plants to make green fuels

ROBO SPACE
Artificial photosynthesis advances with novel solar hydrogen technology

Experts outline potential for hydrogen fuel production using sunlight

AI helps researchers dig through old maps to find lost oil and gas wells

QatarEnergy inks gas supply deal for China with Shell

ROBO SPACE
Landmark Drought Atlas calls for action to address global risks

New research calls for 'radical rethink' on drought

'Future of planet' at stake at ICJ hearings: Vanuatu

Desertification talks open in Saudi Arabia as experts fire warning

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.