It’s been a very eventful and exciting year in AI research. This is especially true if you are interested in LLMs. I had big plans for this December edition and was planning to publish a new article with a discussion of all my research highlights from 2024.
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets. Read previous issues. Subscribe. Join the community ... On top of it, we build vLLM, an LLM serving system that achieves (1) near-zero waste in KV cache memory and (2) flexible sharing of KV cache within and across requests to ...
If you’re looking for a broader list of AI research papers, feel free to check out my earlier article (LLM Research Papers: The 2024 List). Happy new year and happy reading! Table of contents. 1. January: Mixtral’s Mixture of Experts Approach. 1.1 Understanding MoE models; 1.2 The relevance of MoE models today; 2. February: Weight ...
Automating relevance judgments can accelerate research in IR and NLP, making evaluation processes more scalable, efficient, and accessible in low-resource settings. Read Paper Here, Github Here. 6) How to Get Your LLM to Generate Challenging Problems for Evaluation. The rapid evolution of Large Language Models (LLMs) demands new evaluation methods.
Finally, the paper summarizes significant findings from LLM research and consolidates essential architectural and training strategies for developing advanced LLMs.
A curated summary of the most important published LLM & Generative AI papers updated on a weekly basis. Weekly Top LLM Papers In 2025; Weekly Top LLM Papers In 2024
TinyLlama follows Microsoft’s phi-2 as the latest addition to the “small” LLM category, with 1.1 billion parameters. It distinguishes itself by being fully open source, providing transparency in the LLM pre-training community. ... I would love to hear your thoughts on the latest research papers on Large Language Models (LLMs) in 2025.
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics ...
Large Language Models (LLMs) recently demonstrated extraordinary capability in various natural language processing (NLP) tasks including language translation, text generation, question answering, etc. Moreover, LLMs are new and essential part of computerized language processing, having the ability to understand complex verbal patterns and generate coherent and appropriate replies in a given ...
In this article, I summarizes important LLM papers published during April 2024, covering topics such as model reasoning, performance enhancement, or even traits of personality in LLMs. Staying ...
Discover the week's top 10 LLM research papers, highlighting AI Agents, RAG techniques and Evaluation of AI Models. 10th March - 17th March. ... Its a great resource to bookmark if you want to stay ahead on the curve of latest research. Access here. Lets dive into this week's papers: 1) A Survey on Trustworthy LLM Agents: Threats and ...
As April begins, the AI Agent landscape continues to evolve at an historic pace, with groundbreaking research shaping the future of intelligent systems. In this article, we spotlight the Top 10 Cutting-Edge Research Papers on AI Agents from this week, breaking down key insights, examining their impact, and highlighting their. By Paras Madan 08 ...
Ai2 Paper Finder is an LLM-powered literature search system that mimics the iterative paper-finding process. ... While summaries are mostly intended to learn about a new topic, paper finding helps you dig deeper into areas you already know. ... We aim to help advance science by supporting all research needs, from paper finding, literature ...
To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by 20 × growth in LLM submissions to the Computers and Society sub-arXiv. An influx of new authors – half ...
The breakthrough research paper on the transformer architecture that underpins large language models came from Google in 2017, ... Meta’s new open-weight LLM does have its strengths. Llama 4 is ...
Large language models (LLMs) are dramatically influencing AI research, spurring discussions on what has changed so far and how to shape the field's future. To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by ...
Cosine similarity is then calculated between a new essay and the test essay dataset, based on their probability transition vectors. This is the probability that, being in a state \(E_i\) at time n ...