Saudi Press

Saudi Arabia and the world
Saturday, Feb 22, 2025

Google’s SummAE AI generates abstract summaries of paragraphs

Google’s SummAE AI generates abstract summaries of paragraphs

Google researchers propose a novel AI summarization model - SummAE- capable of generating abstract summaries of paragraphs.
Machines have a tougher time summarizing text than you’d think, at least where the summarization is abstractive rather than extractive. While the extraction requires merely concatenating sentences, abstraction involves the task of paraphrasing using novel sentences. Progress has been made in the news domain recently, perhaps owing to the abundance of corpora on which algorithmic systems can be trained. But robust summarization of most other writing forms remains an unsolved problem.

Motivated by this, a team at Google Brain investigated an abstractive summarization system dubbed SummAE that’s largely unsupervised, meaning it’s able to generalize from a small amount of training data to unseen textual examples. While it couldn’t summarize beyond single five-sentence paragraphs, the researchers claim it “significantly” improves upon the baseline and represents a “major” step in the direction of human-level performance.


Machines have a tougher time summarizing text than you’d think, at least where the summarization is abstractive rather than extractive. While the extraction requires merely concatenating sentences, abstraction involves the task of paraphrasing using novel sentences. Progress has been made in the news domain recently, perhaps owing to the abundance of corpora on which algorithmic systems can be trained. But robust summarization of most other writing forms remains an unsolved problem.

Motivated by this, a team at Google Brain investigated an abstractive summarization system dubbed SummAE that’s largely unsupervised, meaning it’s able to generalize from a small amount of training data to unseen textual examples. While it couldn’t summarize beyond single five-sentence paragraphs, the researchers claim it “significantly” improves upon the baseline and represents a “major” step in the direction of human-level performance.

Recommended videosPowered by AnyClip
Go Eat A McRib
Play

Unmute
Duration
0:59
/
Current Time
0:17

Fullscreen
Up Next

NOW PLAYINGGo Eat A McRib
Scientists Discover What Makes 'Water Bears' Virtually Indestructible
Doctor diagnoses his own cancer with an app
There's A Bigger Danger To Pedestrians Than Walking While Distracted
Prince Harry to edit National Geographic's Instagram
The Secret Culprit Of America's Student Debt Crisis
5 Quotes About The Power of Books

The data set and code are freely available on GitHub, along with the configuration settings for the best model.

“As one of the very first works approaching single-document [abstract summarization], we propose a novel neural model — SummAE,” wrote the coauthors. “[We believe it] is therefore desirable to have models capable of automatically summarizing documents abstractively with little to no supervision.”

SummAE contains a denoising autoencoder that encodes (that is, generates numerical representations of) sentences and paragraphs of the target text in a shared space. Guided by a decoder whose input is prepended with a token signaling whether to decode a sentence or a paragraph, the system generates summaries by decoding each sentence from the encoded paragraphs.

The researchers discovered that most traditional approaches to training the auto-encoder resulted in long, multi-sentence summaries. To encourage it to learn higher-level concepts disentangled from their original expression, the team employed two denoising approaches — randomly masking tokens and permuting the order of sentences within paragraphs — that increased the number of training examples substantially. They also experimented with an adversarial critic component that could distinguish between sentences and paragraphs, in addition to two pretraining tasks that encouraged the encoder to learn how sentences narratively followed within a paragraph.

The researchers trained three different variations of SummAE on the ROCStories, a corpus of self-contained, diverse, non-technical, and concise prose. They split the original 98,159 training stories into three separate collections — a training set, a validation set, and a test set — and collected three human summaries each for 500 validation examples and 500 test examples.

After 100,000 training steps with pretraining, the team reports that the best model significantly outperformed a baseline extractive sentence generator on the Recall-Oriented Understudy for Gisting Evaluation (ROUGE), a set of metrics devised to evaluate automatic summarization. Moreover, they say that in a qualitative study involving evaluators recruited through Amazon’s Mechanical Turk, volunteers rated one of the three SummAE models’ summaries “fluent” and “information-relevant” 80% of the time.

“The paragraph reconstructions show some coherence, although with some disfluencies and factual inaccuracies that are common with neural generative models,” wrote the coauthors. “Since the summaries are decoded from the same latent vector as the reconstructions, improving them could lead to more accurate summaries.”
Newsletter

Related Articles

Saudi Press
0:00
0:00
Close
Saudi Arabia and the United States Strengthen Ties Amid Global Developments
Saudi Arabia Hosts Global Conference to Promote Islamic Unity
The Impact of Artificial Intelligence on Education and Child Development
Saudi Arabia Announces Competition for Best Founding Day Outfits
Saudi-EU Food Security Officials Hold Talks to Strengthen Collaboration
Putin Expresses Gratitude to Saudi Crown Prince for Hosting US-Russia Talks
UK and Saudi Arabia Enhance Collaboration in Innovation and Technology
Denmark's Embassy in Riyadh Showcases Danish Cuisine with Saudi Influence
Saudi Artist Salman Al-Amir Unveils 'Tafawut' Exhibition in Riyadh
Saudi Arabia Offers Condolences to Kuwait Following Military Exercise Fatalities
Saudi Ministry of Islamic Affairs Completes Ramadan Preparations in Madinah
Etidal Secretary-General Hosts UN Counter-Terrorism Director in Riyadh
ADNOC Drilling Targets Over $1 Billion in Investments for 2025 Amid Gulf Expansion Plans
Derayah Financial Achieves Remarkable Growth in Saudi Brokerage and Asset Management
Saudi Arabia Shortlists 30 Firms for Mining Licenses in Eastern Province and Tabuk
Saudi Foreign Minister Engages Counterparts at G20 Meeting in Johannesburg
Oil Prices Decline Amid Rising US Inventories
Saudi Arabia's NDMC Plans Green Bond Issuance by 2025
Moody’s Affirms Egypt’s Caa1 Rating Amid Positive Economic Outlook
Oman and Saudi Arabia Strengthen Economic Ties with New Agreements
Saudi Arabia Investments Propel Expansion of Qurayyah Power Plant
Saudi Capital Market Authority Advances SPACs and Direct Listings
Global Energy Leaders Gather in Riyadh for Symposium on Energy Outlooks
Al-Ahsa Region Sees 500% Growth in Tourism as Saudi Arabia Prioritizes Development
Saudi Arabia Advances Entrepreneurial Ecosystem in Al-Ahsa with New Agreement
King Salman Approves Official Saudi Riyal Symbol
Saudi Credit Card Lending Reaches $8.4 Billion Amid Digital Payment Expansion
King Salman Approves Official Symbol for Saudi Riyal
Putin Thanks Saudi Crown Prince for Facilitating U.S.-Russia Discussions
Saudi Foreign Minister Attends G20 Meeting in Johannesburg
Saudi Arabia Prepares for Nationwide Founding Day Celebrations
Inauguration of Hira Park and Walkway Enhances Jeddah's Urban Landscape
Crown Prince Hosts Leaders for Informal Meeting in Riyadh Amid Gaza Rebuilding Plans
Saudi Official Highlights Achievements and Media's Role in National Transformation
Three Expatriate Women Arrested for Prostitution in Riyadh
Saudi Arabia's Diplomatic Evolution Highlighted at Saudi Media Forum
Healthy Eating and Preparation Essential for Ramadan Fasting
Saudi Arabia and Japan Forge Sustainable Textile Partnership
Advanced Limb Surgery Restores Mobility in Pediatric Cancer Patient
Jeddah Event Explores AI's Role in Boosting Saudi Arabia's SME Sector
UN Representative Highlights AI's Role in Perpetuating Gender Stereotypes
Saudi and Jordanian Leaders Discuss Enhanced Security Cooperation in Amman
Saudi British Society Honors Cultural Bridge-Builders at London Gala
Saudi Media Forum 2025 Explores AI's Role in Modern Journalism
Saudi Arabia's Saqer Al-Moqbel Appointed as WTO General Council President for 2025–2026
Saudi Deputy Ministers Engage in Diplomatic Discussions with U.S. and Dutch Officials in Riyadh
Saudi Arabia to Launch Iftar Program in 61 Countries During Ramadan
Saudi Visitors Expected to Spend £942 Million in UK During 2025
Saudi Arabia Gifts Kaaba's Kiswah to Uzbekistan's Center of Islamic Civilization
Digital Cooperation Organization Concludes Fourth General Assembly with Multiple Agreements
×