
In the realm of science journalism, the ability to distill complex research findings into accessible summaries is a critical skill. This task is often seen as an ideal application for large language models, including ChatGPT. However, a recent informal year-long study conducted by the American Association for the Advancement of Science (AAAS) has raised questions about the efficacy of ChatGPT in performing this role. The AAAS set out to explore whether ChatGPT could generate concise "news brief" summaries akin to those produced by their dedicated SciPak team for the journal Science and platforms like EurekAlert. These summaries are meticulously designed to present essential information about a study, including its objectives, methodologies, and broader context, aimed at aiding journalists in crafting their articles. In their findings, detailed in a recent blog post and accompanying white paper, the AAAS journalists concluded that while ChatGPT could somewhat mimic the format of SciPak-style briefs, its outputs often prioritized simplicity at the expense of accuracy. As a result, the summaries frequently required extensive fact-checking by experienced SciPak writers. Abigail Eisenstadt, a writer at AAAS, remarked that while these AI technologies show promise as potential aids for science journalists, they are not yet ready for mainstream use within the SciPak team. Over the course of the study from December 2023 to December 2024, AAAS researchers tasked ChatGPT with summarizing up to two scientific papers each week, using a range of prompts that varied in specificity. The focus was on papers that contained challenging elements such as technical jargon, controversial findings, groundbreaking insights, studies involving human subjects, or unconventional formats. The summaries were generated using the latest available versions of GPT models during the study period, primarily GPT-4 and GPT-4o. In total, 64 papers were summarized, with the results assessed both quantitatively and qualitatively by the same SciPak writers who had originally briefed those papers. The researchers acknowledged a limitation in their design, noting that it could not account for potential human biases, which might be particularly pronounced among journalists scrutinizing a tool that threatens to encroach upon their professional territory.
The International Imaging Technology Council (Int’l ITC) has raised concerns against HP regarding recent firmware update...
Ars Technica | Mar 12, 2026, 20:35
Nvidia is set to launch its annual GTC developer conference next week in San Jose, California, with the highly anticipat...
TechCrunch | Mar 12, 2026, 23:45
Substack is making significant strides in the realm of video content with the introduction of its new Substack Recording...
TechCrunch | Mar 12, 2026, 18:45
Rivian has unveiled the specifications and pricing details for its highly anticipated R2 SUV, but customers eager to pur...
TechCrunch | Mar 12, 2026, 21:00
In the wake of recent airstrikes by the US and Israel on Iran, cybersecurity experts issued warnings to organizations wo...
Ars Technica | Mar 12, 2026, 22:20