Google DeepMind at ICML 2024

Analysis

Revealed: 19 July 2024

Exploring AGI, the challenges of scaling and the way forward for multimodal generative AI

Subsequent week the factitious intelligence (AI) group will come collectively for the 2024 Worldwide Convention on Machine Studying (ICML). Operating from July 21-27 in Vienna, Austria, the convention is a world platform for showcasing the most recent advances, exchanging concepts and shaping the way forward for AI analysis.

This 12 months, groups from throughout Google DeepMind will current greater than 80 analysis papers. At our sales space, we’ll additionally showcase our multimodal on-device mannequin, Gemini Nano, our new household of AI fashions for schooling known as LearnLM and we’ll demo TacticAI, an AI assistant that may assist with soccer techniques.

Right here we introduce a few of our oral, highlight and poster shows:

Defining the trail to AGI

What’s synthetic basic intelligence (AGI)? The phrase describes an AI system that’s a minimum of as succesful as a human at most duties. As AI fashions proceed to advance, defining what AGI might appear to be in follow will turn into more and more vital.

We’ll current a framework for classifying the capabilities and behaviors of AGI fashions. Relying on their efficiency, generality and autonomy, our paper categorizes methods starting from non-AI calculators to rising AI fashions and different novel applied sciences.

We’ll additionally present that open-endedness is essential to constructing generalized AI that goes past human capabilities. Whereas many current AI advances have been pushed by present Web-scale information, open-ended methods can generate new discoveries that reach human data.

At ICML, we’ll be demoing Genie, a mannequin that may generate a variety of playable environments primarily based on textual content prompts, pictures, pictures, or sketches.

Scaling AI methods effectively and responsibly

Creating bigger, extra succesful AI fashions requires extra environment friendly coaching strategies, nearer alignment with human preferences and higher privateness safeguards.

We’ll present how utilizing classification as an alternative of regression strategies makes it simpler to scale deep reinforcement studying methods and obtain state-of-the-art efficiency throughout completely different domains. Moreover, we suggest a novel strategy that predicts the distribution of penalties of a reinforcement studying agent’s actions, serving to quickly consider new situations.

Our researchers current an alignment-maintaining strategy that reduces the necessity for human oversight, and a new strategy to fine-tuning massive language fashions (LLMs), primarily based on recreation idea, higher aligns a LLM’s output with human preferences.

We critique the strategy of coaching fashions on public information and solely fine-tuning with “differentially non-public” coaching, and argue this strategy could not supply the privateness or utility that’s usually claimed it does.

VideoPoet is a big language mannequin for zero-shot video era.

New approaches in generative AI and multimodality

Generative AI applied sciences and multimodal capabilities are increasing the artistic potentialities of digital media.

We’ll current VideoPoet, which makes use of an LLM to generate state-of-the-art video and audio from multimodal inputs together with pictures, textual content, audio and different video.

And share Genie (generative interactive environments), which may generate a variety of playable environments for coaching AI brokers, primarily based on textual content prompts, pictures, pictures, or sketches.

Lastly, we introduce MagicLens, a novel picture retrieval system that makes use of textual content directions to retrieve pictures with richer relations past visible similarity.

Supporting the AI group

We’re proud to sponsor ICML and foster a various group in AI and machine studying by supporting initiatives led by Incapacity in AI, Queer in AI, LatinX in AI and Girls in Machine Studying.

For those who’re on the convention, go to the Google DeepMind and Google Analysis cubicles to fulfill our groups, see stay demos and discover out extra about our analysis.

Study extra

Why Creators Are Craving Unfiltered AI Video Mills

6 New ChatGPT Tasks Options You Have to Know

combining generative AI with live-action filmmaking

Analysis

Revealed: 19 July 2024

Exploring AGI, the challenges of scaling and the way forward for multimodal generative AI

Right here we introduce a few of our oral, highlight and poster shows:

Defining the trail to AGI

At ICML, we’ll be demoing Genie, a mannequin that may generate a variety of playable environments primarily based on textual content prompts, pictures, pictures, or sketches.

Scaling AI methods effectively and responsibly

Creating bigger, extra succesful AI fashions requires extra environment friendly coaching strategies, nearer alignment with human preferences and higher privateness safeguards.

VideoPoet is a big language mannequin for zero-shot video era.

New approaches in generative AI and multimodality

Generative AI applied sciences and multimodal capabilities are increasing the artistic potentialities of digital media.

We’ll current VideoPoet, which makes use of an LLM to generate state-of-the-art video and audio from multimodal inputs together with pictures, textual content, audio and different video.

Lastly, we introduce MagicLens, a novel picture retrieval system that makes use of textual content directions to retrieve pictures with richer relations past visible similarity.

Supporting the AI group

We’re proud to sponsor ICML and foster a various group in AI and machine studying by supporting initiatives led by Incapacity in AI, Queer in AI, LatinX in AI and Girls in Machine Studying.

For those who’re on the convention, go to the Google DeepMind and Google Analysis cubicles to fulfill our groups, see stay demos and discover out extra about our analysis.

Study extra

Google DeepMind at ICML 2024

Why Creators Are Craving Unfiltered AI Video Mills

6 New ChatGPT Tasks Options You Have to Know

combining generative AI with live-action filmmaking

Information to Uber’s H3 for Spatial Indexing

Information Analytics Jobs That Are in Demand – Dataquest

Md Sazzad Hossain

Related Posts

Why Creators Are Craving Unfiltered AI Video Mills

6 New ChatGPT Tasks Options You Have to Know

combining generative AI with live-action filmmaking

Photonic processor may streamline 6G wi-fi sign processing | MIT Information

Construct a Safe AI Code Execution Workflow Utilizing Daytona SDK

Information Analytics Jobs That Are in Demand – Dataquest

Leave a Reply Cancel reply

Recommended

How Sample PXM’s Content material Transient is driving conversion on ecommerce marketplaces utilizing AI

What’s Generative AI and How Is It Being Used? – Dataquest

Categories

CyberDefenseGo

Recent

Addressing Vulnerabilities in Positioning, Navigation and Timing (PNT) Companies

Discord Invite Hyperlink Hijacking Delivers AsyncRAT and Skuld Stealer Concentrating on Crypto Wallets

Search

Welcome Back!

Retrieve your password

Google DeepMind at ICML 2024

Defining the trail to AGI

Scaling AI methods effectively and responsibly

New approaches in generative AI and multimodality

Supporting the AI group

Study extra

You might also like

Defining the trail to AGI

Scaling AI methods effectively and responsibly

New approaches in generative AI and multimodality

Supporting the AI group

Study extra

Information to Uber’s H3 for Spatial Indexing

Information Analytics Jobs That Are in Demand – Dataquest

Related Posts

Leave a Reply Cancel reply

Recommended

Categories

CyberDefenseGo

Recent

Search

Welcome Back!

Retrieve your password