News

AppleInsider · 3d

How Apple's new Machine Learning research will help Apple Intelligence get smarter

Apple's machine learning researchers have worked on myriad ways to improve Apple Intelligence and other generative AI systems, as its research papers accepted by a major AI conference demonstrate.
Ars Technica · 28d

Why do LLMs make stuff up? New research peers under the hood.

Now, new research from Anthropic is exposing at least some of the inner neural network "circuitry" that helps an LLM decide when to take a stab at a (perhaps hallucinated) response versus when to refuse an answer in the first place.
PsyPost on MSN.com · 2d

LLM red teamers: People are hacking AI chatbots just for fun and now researchers have catalogued 35 “jailbreak” techniques

What happens when people push artificial intelligence to its limits—not for profit or malice, but out of curiosity and creativity? A new study published in PLOS One explores the world of “LLM red teamers,
THE Journal · 7d

AI 'Microscope' Reveals the Hidden Mechanics of LLM Thought

Anthropic has introduced new research tools designed to provide a rare glimpse into the hidden reasoning processes of advanced language models — like a 'microscope' for AI.
ZDNET · 1d

Nvidia's 70+ projects at ICLR show how raw chip power is central to AI's acceleration

The neural net of Fugatto is one developed at Google in 2022 that can operate on "spectrograms," sounds as wave patterns. The original contribution of Nvidia's Rafael Valle and his team is a new dataset and a training regimen that teaches the model to handle complex textual commands.
Slator · 14h

Alibaba and Meta Face Off in Simultaneous AI Translation

Their method, AliBaStr-MT (Alignment-Based Streaming Machine Translation), builds on a pre-trained translation model and adds a small module that helps the model decide when to “read” more input and when to “write” the translation.
Icon for www.cs.utexas.eduDepartment of Computer Science - University of Texas at Austin · 10d

John Simon Guggenheim Foundation Names Computer Scientist a 2025 Fellow

The John Simon Guggenheim Memorial Foundation has named Swarat Chaudhuri, a professor of computer science at The University of Texas at Austin, among its 100th class of Guggenheim Fellows. The fellowship offers support to exceptional individuals in pursuit of scholarship in any field of knowledge and creation in any art form.
College of Computing - Georgia Tech · 8d

Q&A: Computer Science Major Joni Isbell is Charting Her Own Research Course

Joni Isbell is a third-year computer science major studying cross-language alignment with School of Modern Language Assistant Professor Hongchen Wu. She also studies LLM suspense detection with Interactive Computing Professor Mark Riedl. The following is part of a series of Q&As from the Undergraduate Research Opportunities program.
South China Morning Post · 1d

DeepSeek opens roles for ‘product and design’ as start-up keeps mum on new AI model

This marks the first time DeepSeek has published product-related job openings, as it has largely focused on fundamental AI model research.
01net · 1d

NTT Scientists Present Breakthrough Research on AI Deep Learning at ICLR 2025

NTT Research and NTT R&D co-authored papers explore LLMs’ uncertain and open-ended nature, the “emergence” phenomenon, In-Context Learning and more News Highlights: Nine papers presented at esteemed international conference by NTT Research
Slator · 3d

Get Ready for MT Summit 2025: Highlights and Expectations

Gender inclusivity and bias mitigation in MT is another key theme. The 3rd International Workshop on Gender-Inclusive Translation Technologies (GITT 2025) will feature a solid set of submissions and a keynote talk on inclusive language in localization with AI technologies.
Icon for www.the-scientist.comThe Scientist · Feb 19, 2025

Detection or Deception: The Double-Edged Sword of AI in Research Misconduct

A handful of papers recently published by separate research groups had all linked the expression ... Just a couple of years later, ChatGPT, OpenAI’s large language model (LLM), was released. Now, anyone can feed the virtual writing assistant successive ...