mavii AI

I analyzed the results on this page and here's what I found for you…

Noteworthy LLM Research Papers of 2024 - sebastianraschka.com

If you’re looking for a broader list of AI research papers, feel free to check out my earlier article (LLM Research Papers: The 2024 List). Happy new year and happy reading! Table of contents. 1. January: Mixtral’s Mixture of Experts Approach. 1.1 Understanding MoE models; 1.2 The relevance of MoE models today; 2. February: Weight ...

LLM Research Papers: The 2024 List - magazine.sebastianraschka.com

In the meantime, I want to share my running bookmark list of many fascinating (mostly LLM-related) papers I stumbled upon in 2024. It’s just a list, but maybe it will come in handy for those who are interested in finding some gems to read for the holidays.

[2402.06196] Large Language Models: A Survey - arXiv.org

The research area of LLMs, while very recent, is evolving rapidly in many different ways. In this paper, we review some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discuss their characteristics, contributions and limitations. We also give an overview of techniques developed to build, and augment LLMs.

Noteworthy AI Research Papers of 2024 (Part One)

Only a few days into January 2024, the Mistral AI team shared the Mixtral of Experts paper (8 Jan 2024), which described Mixtral 8x7B, a Sparse Mixture of Experts (SMoE) model.. The paper and model were both very influential at the time, as Mixtral 8x7B was (one of) the first open-weight MoE LLMs with an impressive performance: it outperformed Llama 2 70B and GPT-3.5 across various benchmarks.

Research Papers in January 2024 - Sebastian Raschka, PhD

In this WARM: On the Benefits of Weight Averaged Reward Models (Jan 22), researchers propose a weight averaging approach for LLM reward models. ("Reward models" refer to those used in reinforcement learning with human feedback, RLHF, for alignment.). What is weight averaging? Since weight averaging and model merging for LLMs seem to be the most interesting themes in 2024, I want to briefly ...

[2409.04109] Can LLMs Generate Novel Research Ideas? A Large-Scale ...

Recent advancements in large language models (LLMs) have sparked optimism about their potential to accelerate scientific discovery, with a growing number of works proposing research agents that autonomously generate and validate new ideas. Despite this, no evaluations have shown that LLM systems can take the very first step of producing novel, expert-level ideas, let alone perform the entire ...

Papers | Prompt Engineering Guide

LLM Research Findings. LLM Agents; RAG for LLMs; LLM Reasoning; RAG Faithfulness; LLM In-Context Recall; ... Papers. The following are the latest papers (sorted by release date) on prompt engineering for large language models (LLMs). ... (February 2024) Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4 (opens in a ...

Research Papers in February 2024 - Sebastian Raschka, PhD

I'm an LLM Research Engineer with over a decade of experience in artificial intelligence. My work bridges academia and industry, with roles including senior staff at an AI company and a statistics professor. ... Research Papers in February 2024 — A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM Research. RSS Feed ...

LLM Research Recap of 2024, Fine-tuning LLM Judges, Amazon's Nova ...

The text lists recent research papers from 2024 focused on advancements in large language models (LLMs). Topics include improving text generation, memorization, and model efficiency. The papers also explore the use of instruction tuning and retrieval-augmented techniques in enhancing LLM capabilities.

A Review on Large Language Models: Architectures, Applications ...

Large Language Models (LLMs) recently demonstrated extraordinary capability in various natural language processing (NLP) tasks including language translation, text generation, question answering, etc. Moreover, LLMs are new and essential part of computerized language processing, having the ability to understand complex verbal patterns and generate coherent and appropriate replies in a given ...

[2307.06435] A Comprehensive Overview of Large Language Models - arXiv.org

Abstract page for arXiv paper 2307.06435: A Comprehensive Overview of Large Language Models ... With the rapid development of techniques and regular breakthroughs in LLM research, it has become considerably challenging to perceive the bigger picture of the advances in this direction. ... (1,533 KB) Tue, 9 Apr 2024 21:38:33 UTC (1,691 KB) [v10 ...

Papers - LLM Evaluation

DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents . Added on: 26/01/2024. ICML 2024. An extension to the former DyVal. Kaijie Zhu 1,2, Jindong Wang #1, Qinlin Zhao *3, Ruochen Xu 1, Xing Xie 1. 1 Microsoft Research Asia, 2 Institute of Automation, CAS, 3 University of Science and Technology of China (#: Corresponding author)

Topics, Authors, and Institutions in Large Language Model Research ...

To clarify such questions, we analyze a new dataset of 16,979 LLM-related arXiv papers, focusing on recent trends in 2023 vs. 2018-2022. First, we study disciplinary shifts: LLM research increasingly considers societal impacts, evidenced by 20 × growth in LLM submissions to the Computers and Society sub-arXiv. An influx of new authors – half ...

Let’s read top LLM papers in personalizing AI in April 2024

In this article, I summarizes important LLM papers published during April 2024, covering topics such as model reasoning, performance enhancement, or even traits of personality in LLMs. Staying ...

GitHub - azminewasi/Awesome-LLMs-ICLR-24: It is a comprehensive ...

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024. - azminewasi/Awesome-LLMs-ICLR-24

Noteworthy AI Research Papers of 2024 (Part Two)

Readers are probably already well familiar with Meta AI's Llama 3 models and paper, but since these are such important and widely-used models, I want to dedicate the July section to The Llama 3 Herd of Models (July 2024) paper by Grattafiori and colleagues.. What's notable about the Llama 3 model family is the increased sophistication of the pre-training and post-training pipelines compared to ...

My top 10 LLM research papers in 2024 - keywordsai.co

By the end of 2024, I had read 61 research papers — all of which left me with new ideas and deeper insights. I’ve recorded these papers on my Notion page, and I’ll share the full list at the end of this blog. In this first part, I want to spotlight the Top 10 LLM research papers I read in 2024. These papers stood out for their impact ...

Divergent LLM Adoption and Heterogeneous Convergence Paths in Research ...

Our findings reveal substantial disparities in LLM adoption across academic disciplines, gender, native language status, and career stage, alongside a rapid evolution in scholarly writing styles. ... and Heterogeneous Convergence Paths in Research Writing * (April 14, 2025). Cornell SC Johnson College of Business Research Paper Forthcoming ...

Paper Digest: ICLR 2024 Papers & Highlights

To search or review papers within ICLR-2024 related to a specific topic, please use the search by venue (ICLR-2024), review by venue ... Highlight: A fundamental research question is: will LLM-generated misinformation cause more harm than human-written misinformation? We propose to tackle this question from the perspective of detection difficulty.

Top Important LLM Papers for the Week from 29/04 to 05/05

This article summarizes some of the most important LLM papers published during the Fourth Week of April 2024. The papers cover various topics shaping the next generation of language models, from model optimization and scaling to reasoning, benchmarking, and enhancing performance. Keeping up with novel LLM research across these domains will help ...