mavii AI

I analyzed the results on this page and here's what I found for you…

[2402.06196] Large Language Models: A Survey - arXiv.org

The research area of LLMs, while very recent, is evolving rapidly in many different ways. In this paper, we review some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discuss their characteristics, contributions and limitations. We also give an overview of techniques developed to build, and augment LLMs.

[2409.04109] Can LLMs Generate Novel Research Ideas? A Large-Scale ...

Recent advancements in large language models (LLMs) have sparked optimism about their potential to accelerate scientific discovery, with a growing number of works proposing research agents that autonomously generate and validate new ideas. Despite this, no evaluations have shown that LLM systems can take the very first step of producing novel, expert-level ideas, let alone perform the entire ...

[2502.09992] Large Language Diffusion Models - arXiv.org

Autoregressive models (ARMs) are widely regarded as the cornerstone of large language models (LLMs). We challenge this notion by introducing LLaDA, a diffusion model trained from scratch under the pre-training and supervised fine-tuning (SFT) paradigm. LLaDA models distributions through a forward data masking process and a reverse process, parameterized by a vanilla Transformer to predict ...
AxiosError: Request failed with status code 401

LLM Research Papers: The 2024 List - magazine.sebastianraschka.com

It’s been a very eventful and exciting year in AI research. This is especially true if you are interested in LLMs. I had big plans for this December edition and was planning to publish a new article with a discussion of all my research highlights from 2024.

Papers with Code - Large Language Model

Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets. Read previous issues. Subscribe. Join the community ... On top of it, we build vLLM, an LLM serving system that achieves (1) near-zero waste in KV cache memory and (2) flexible sharing of KV cache within and across requests to ...

Noteworthy LLM Research Papers of 2024 - sebastianraschka.com

If you’re looking for a broader list of AI research papers, feel free to check out my earlier article (LLM Research Papers: The 2024 List). Happy new year and happy reading! Table of contents. 1. January: Mixtral’s Mixture of Experts Approach. 1.1 Understanding MoE models; 1.2 The relevance of MoE models today; 2. February: Weight ...

Top 10 LLM Papers from February: Benchmarking, Evaluation

Automating relevance judgments can accelerate research in IR and NLP, making evaluation processes more scalable, efficient, and accessible in low-resource settings. Read Paper Here, Github Here. 6) How to Get Your LLM to Generate Challenging Problems for Evaluation. The rapid evolution of Large Language Models (LLMs) demands new evaluation methods.

(PDF) A Comprehensive Overview of Large Language Models - ResearchGate

Finally, the paper summarizes significant findings from LLM research and consolidates essential architectural and training strategies for developing advanced LLMs.

youssefHosni/Weekly-Top-LLM-Papers - GitHub

A curated summary of the most important published LLM & Generative AI papers updated on a weekly basis. Weekly Top LLM Papers In 2025; Weekly Top LLM Papers In 2024

4 LLMs Research Paper in January 2025 - Analytics Vidhya

TinyLlama follows Microsoft’s phi-2 as the latest addition to the “small” LLM category, with 1.1 billion parameters. It distinguishes itself by being fully open source, providing transparency in the LLM pre-training community. ... I would love to hear your thoughts on the latest research papers on Large Language Models (LLMs) in 2025.

[2307.06435] A Comprehensive Overview of Large Language Models - arXiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics ...

A Review on Large Language Models: Architectures, Applications ...

Large Language Models (LLMs) recently demonstrated extraordinary capability in various natural language processing (NLP) tasks including language translation, text generation, question answering, etc. Moreover, LLMs are new and essential part of computerized language processing, having the ability to understand complex verbal patterns and generate coherent and appropriate replies in a given ...

Let’s read top LLM papers in personalizing AI in April 2024

In this article, I summarizes important LLM papers published during April 2024, covering topics such as model reasoning, performance enhancement, or even traits of personality in LLMs. Staying ...

Top 10 LLM Research Papers of the Week: 10th Mar - 17th Mar

Discover the week's top 10 LLM research papers, highlighting AI Agents, RAG techniques and Evaluation of AI Models. 10th March - 17th March. ... Its a great resource to bookmark if you want to stay ahead on the curve of latest research. Access here. Lets dive into this week's papers: 1) A Survey on Trustworthy LLM Agents: Threats and ...

Top 10 LLM Research Papers of the Week: 27 Dec - 3 Jan

As April begins, the AI Agent landscape continues to evolve at an historic pace, with groundbreaking research shaping the future of intelligent systems. In this article, we spotlight the Top 10 Cutting-Edge Research Papers on AI Agents from this week, breaking down key insights, examining their impact, and highlighting their. By Paras Madan 08 ...

Introducing Ai2 Paper Finder | Ai2 - allenai.org

Ai2 Paper Finder is an LLM-powered literature search system that mimics the iterative paper-finding process. ... While summaries are mostly intended to learn about a new topic, paper finding helps you dig deeper into areas you already know. ... We aim to help advance science by supporting all research needs, from paper finding, literature ...

Topics, Authors, and Institutions in Large Language Model Research ...

To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by 20 × growth in LLM submissions to the Computers and Society sub-arXiv. An influx of new authors – half ...

Google Leads With LLMs, Meta and OpenAI Struggle - IEEE Spectrum

The breakthrough research paper on the transformer architecture that underpins large language models came from Google in 2017, ... Meta’s new open-weight LLM does have its strengths. Llama 4 is ...

Topics, Authors, and Institutions in Large Language Model Research ...

Large language models (LLMs) are dramatically influencing AI research, spurring discussions on what has changed so far and how to shape the field's future. To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by ...

An LLM-based hybrid approach for enhanced automated essay scoring

Cosine similarity is then calculated between a new essay and the test essay dataset, based on their probability transition vectors. This is the probability that, being in a state \(E_i\) at time n ...