mavii

mavii AI

I analyzed the results on this page and here's what I found for you…

[2402.06196] Large Language Models: A Survey - arXiv.org

The research area of LLMs, while very recent, is evolving rapidly in many different ways. In this paper, we review some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discuss their characteristics, contributions and limitations. We also give an overview of techniques developed to build, and augment LLMs.

Large Language Models: A Survey - arXiv.org

fine-tuning, and evaluation, review widely used LLM evaluation metrics, and compare the performance of several popular LLMs on a set of representative benchmarks. Finally, we conclude the paper by discussing open challenges and future research directions. I. INTRODUCTION Language modeling is a long-standing research topic, dat-

[2409.04109] Can LLMs Generate Novel Research Ideas? A Large-Scale ...

Recent advancements in large language models (LLMs) have sparked optimism about their potential to accelerate scientific discovery, with a growing number of works proposing research agents that autonomously generate and validate new ideas. Despite this, no evaluations have shown that LLM systems can take the very first step of producing novel, expert-level ideas, let alone perform the entire ...

[2502.09992] Large Language Diffusion Models - arXiv.org

Autoregressive models (ARMs) are widely regarded as the cornerstone of large language models (LLMs). We challenge this notion by introducing LLaDA, a diffusion model trained from scratch under the pre-training and supervised fine-tuning (SFT) paradigm. LLaDA models distributions through a forward data masking process and a reverse process, parameterized by a vanilla Transformer to predict ...

[2307.06435] A Comprehensive Overview of Large Language Models - arXiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics ...

arxiv.org › abs › 2409.

[2409.04109] Can LLMs Generate Novel Research Ideas? A Large-Scale ...

News

More News

PsyPost on MSN.com · 2d

LLM red teamers: People are hacking AI chatbots just for fun and now researchers have catalogued 35 “jailbreak” techniques

What happens when people push artificial intelligence to its limits—not for profit or malice, but out of curiosity and creativity? A new study published in PLOS One explores the world of “LLM red teamers,

AppleInsider · 3d

How Apple's new Machine Learning research will help Apple Intelligence get smarter

Apple's machine learning researchers have worked on myriad ways to improve Apple Intelligence and other generative AI systems, as its research papers accepted by a major AI conference demonstrate.

ZDNET · 1d

Nvidia's 70+ projects at ICLR show how raw chip power is central to AI's acceleration

The neural net of Fugatto is one developed at Google in 2022 that can operate on "spectrograms," sounds as wave patterns. The original contribution of Nvidia's Rafael Valle and his team is a new dataset and a training regimen that teaches the model to handle complex textual commands.

Slator · 8h

Alibaba and Meta Face Off in Simultaneous AI Translation

Their method, AliBaStr-MT (Alignment-Based Streaming Machine Translation), builds on a pre-trained translation model and adds a small module that helps the model decide when to “read” more input and when to “write” the translation.

arxiv.org › abs › 2502.

[2502.09992] Large Language Diffusion Models - arXiv.org

Discussions

More Discussions

AxiosError: Request failed with status code 401

magazine.sebastianraschka.com › llm-research-papers-the-2024-list

LLM Research Papers: The 2024 List - magazine.sebastianraschka.com

It’s been a very eventful and exciting year in AI research. This is especially true if you are interested in LLMs. I had big plans for this December edition and was planning to publish a new article with a discussion of all my research highlights from 2024.

paperswithcode.com › task › large-language-model

Papers with Code - Large Language Model

Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets. Read previous issues. Subscribe. Join the community ... On top of it, we build vLLM, an LLM serving system that achieves (1) near-zero waste in KV cache memory and (2) flexible sharing of KV cache within and across requests to ...

sebastianraschka.com › blog › 2025 › llm-research-2024

Noteworthy LLM Research Papers of 2024 - sebastianraschka.com

If you’re looking for a broader list of AI research papers, feel free to check out my earlier article (LLM Research Papers: The 2024 List). Happy new year and happy reading! Table of contents. 1. January: Mixtral’s Mixture of Experts Approach. 1.1 Understanding MoE models; 1.2 The relevance of MoE models today; 2. February: Weight ...

hub.athina.ai › top-10-llm-papers-from-february-2025-1

Top 10 LLM Papers from February: Benchmarking, Evaluation

Automating relevance judgments can accelerate research in IR and NLP, making evaluation processes more scalable, efficient, and accessible in low-resource settings. Read Paper Here, Github Here. 6) How to Get Your LLM to Generate Challenging Problems for Evaluation. The rapid evolution of Large Language Models (LLMs) demands new evaluation methods.

researchgate.net › publication › _A_Comprehensive_Overview_of_Large_Language_Models

(PDF) A Comprehensive Overview of Large Language Models - ResearchGate

Finally, the paper summarizes significant findings from LLM research and consolidates essential architectural and training strategies for developing advanced LLMs.

github.com › youssefHosni › Weekly-Top-LLM-Papers

youssefHosni/Weekly-Top-LLM-Papers - GitHub

A curated summary of the most important published LLM & Generative AI papers updated on a weekly basis. Weekly Top LLM Papers In 2025; Weekly Top LLM Papers In 2024

analyticsvidhya.com › blog › 2024 › llms-research-paper-in-january

4 LLMs Research Paper in January 2025 - Analytics Vidhya

TinyLlama follows Microsoft’s phi-2 as the latest addition to the “small” LLM category, with 1.1 billion parameters. It distinguishes itself by being fully open source, providing transparency in the LLM pre-training community. ... I would love to hear your thoughts on the latest research papers on Large Language Models (LLMs) in 2025.

arxiv.org › abs › 2307.

[2307.06435] A Comprehensive Overview of Large Language Models - arXiv.org

ieeexplore.ieee.org › document

A Review on Large Language Models: Architectures, Applications ...

Large Language Models (LLMs) recently demonstrated extraordinary capability in various natural language processing (NLP) tasks including language translation, text generation, question answering, etc. Moreover, LLMs are new and essential part of computerized language processing, having the ability to understand complex verbal patterns and generate coherent and appropriate replies in a given ...

medium.com › @minhle_0210 › lets-read-top-llm-papers-in-personalizing-ai-in-april-2024-cca8f8686d1a

Let’s read top LLM papers in personalizing AI in April 2024

In this article, I summarizes important LLM papers published during April 2024, covering topics such as model reasoning, performance enhancement, or even traits of personality in LLMs. Staying ...

hub.athina.ai › top-10-llm-papers-of-the-week-10

Top 10 LLM Research Papers of the Week: 10th Mar - 17th Mar

Discover the week's top 10 LLM research papers, highlighting AI Agents, RAG techniques and Evaluation of AI Models. 10th March - 17th March. ... Its a great resource to bookmark if you want to stay ahead on the curve of latest research. Access here. Lets dive into this week's papers: 1) A Survey on Trustworthy LLM Agents: Threats and ...

hub.athina.ai › top-10-llm-papers-of-the-week-2

Top 10 LLM Research Papers of the Week: 27 Dec - 3 Jan

As April begins, the AI Agent landscape continues to evolve at an historic pace, with groundbreaking research shaping the future of intelligent systems. In this article, we spotlight the Top 10 Cutting-Edge Research Papers on AI Agents from this week, breaking down key insights, examining their impact, and highlighting their. By Paras Madan 08 ...

allenai.org › blog › paper-finder

Introducing Ai2 Paper Finder | Ai2 - allenai.org

Ai2 Paper Finder is an LLM-powered literature search system that mimics the iterative paper-finding process. ... While summaries are mostly intended to learn about a new topic, paper finding helps you dig deeper into areas you already know. ... We aim to help advance science by supporting all research needs, from paper finding, literature ...

aclanthology.org › 2024.naacl-long.67

Topics, Authors, and Institutions in Large Language Model Research ...

To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by 20 × growth in LLM submissions to the Computers and Society sub-arXiv. An influx of new authors – half ...

spectrum.ieee.org › large-language-models-2025

Google Leads With LLMs, Meta and OpenAI Struggle - IEEE Spectrum

The breakthrough research paper on the transformer architecture that underpins large language models came from Google in 2017, ... Meta’s new open-weight LLM does have its strengths. Llama 4 is ...

arxiv.org › abs › 2307.

Topics, Authors, and Institutions in Large Language Model Research ...

Large language models (LLMs) are dramatically influencing AI research, spurring discussions on what has changed so far and how to shape the field's future. To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by ...

nature.com › articles › s-025-87862-3

An LLM-based hybrid approach for enhanced automated essay scoring

Cosine similarity is then calculated between a new essay and the test essay dataset, based on their probability transition vectors. This is the probability that, being in a state \(E_i\) at time n ...