mavii

Videos

Video thumbnail for DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?

DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?

In this video, we dive into the groundbreaking DeepSeek-R1 research paper, titled "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning". This paper introduces the models DeepSeek-R1-Zero and DeepSeek-R1, open-source reasoning models that rivals the performance of top-tier models like OpenAI's o1! Here's a quick ...

What Do LLM Benchmarks Actually Tell Us? (+ How to Run Your Own)

Interpreting and running standardized language model benchmarks and evaluation datasets for both generalized and task specific performance assessments! Resources: lm-evaluation-harness: https://github.com/EleutherAI/lm-evaluation-harness lm-evaluation-harness setup script: https://drive.google.com/file/d/1oWoWSBUdCiB82R-8m52nv_-5pylXEcDp/view ...

Paper: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper reading in the Discord group. All the lecture was improvised. Join the group: https://discord.gg/JRKsaNbhCg Link to paper: https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf

LLM Course – Build a Semantic Book Recommender (Python, OpenAI, LangChain, Gradio)

Discover how to build an intelligent book recommendation system using the power of large language models and Python. Learn to transform book descriptions into mathematical representations that enable precise content-based matching. By the end of this course, you'll have built a recommendation engine that helps readers discover their next ...

Understanding DeepSeek LLM Research Paper | How DeepSeek LLM Model Solves Problems | Open Source AI

Let’s explore the DeepSeek LLM research paper! Discover how DeepSeek, an open-source AI model, processes information to solve complex problems in math, coding, and more: https://patentbusinesslawyer.com/deepseek-api-documentation-guide-building-innovations/ Trained on an astonishing 2 trillion words, DeepSeek not only surpasses models like ...

DAPO: An Open-Source LLM Reinforcement Learning System at Scale (Paper Walkthrough)

📖Paper: https://dapo-sia.github.io/static/pdf/dapo_paper.pdf 🐈‍⬛Github: https://github.com/BytedTsinghua-SIA/DAPO 🏫Institutes: ByteDance Seed, Tsinghua University LLM Reinforcement Learning: DAPO Unlocks the Black Box! This research introduces the Decoupled Clip and Dynamic Sampling Policy Optimization (DAPO) algorithm, an open ...

Lecture 2: Large Language Models (LLM) Basics

In this lecture, we understand the basics of LLMs: (1) Introduction (2) Significance of the word “Large” in Large Language Models (3) LLMs vs Earlier NLP Models (4) LLM Secret Sauce: Transformers (5) Difference between the terminologies: LLM vs GenAI vs DL vs ML vs AI (6) Applications of LLMs The key reference book which this video series ...

The 17 most awesome NLP and LLM research papers from last the 12 months!

This is a rundown of the best NLP, Transformers and LLM papers of 2024. The papers in this video cover a wide range of disciplines - from core deep learning research, novel tokenization schemes, resurrecting RNNs, building on the Mamba architecture, to the latest prompting techniques for efficient retrieval, continual learning, and maximizing ...

Unstract - How to extract data from PDFs using LLMs

This video is a step-by-step easy demo of Unstract, an open-source, no-code platform that helps you automate complex business processes involving long, complicated documents with a human in the loop by leveraging LLMs. Unstract goes beyond what current IDP (Intelligent Document Processing) and RPA (Robotic Process Automation) systems are ...

Breaking Down & Testing FIVE LLM Agent Architectures - (Reflexion, LATs, P&E, ReWOO, LLMCompiler)

Large Language Model Agents have taken over LLM and Artificial Intelligence application design by storm, so this time we check out and simplify six main concepts and five popular papers documenting ways to set up language model based agents, as well as directly testing examples. Resources - @LangChain Agent Tutorials & Code - https://www ...

GraphRAG: LLM-Derived Knowledge Graphs for RAG

Watch my colleague Jonathan Larson present on GraphRAG! GraphRAG is a research project from Microsoft exploring the use of knowledge graphs and large language models for enhanced retrieval augmented generation. It is an end-to-end system for richly understanding text-heavy datasets by combining text extraction, network analysis, LLM prompting ...

LLM Interpretability: Exploring the Latest Research from OpenAI and Anthropic

Join us as we discuss the latest research from OpenAI and Anthropic. We’re excited to chat about this significant step forward in understanding how LLMs work and the implications it has for deeper understanding of the neural activity of language models. Hope you can join the conversation! Open AI’s Paper: https://openai.com/index/extracting ...

Beyond the Basics: The Role of LLM in Modern Threat Intelligence

Threat intelligence is replete with challenges, necessitating a large experience, knowledge, and techniques to really understand the threat landscape, the TTPs, and to accurately track threat actors. Given this context, it is crucial to innovate and introduce the tools and techniques to both the current and next generation of analysts who stand ...

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters (Paper)

How can one best use extra FLOPS at test time? Paper: https://arxiv.org/abs/2408.03314 Abstract: Enabling LLMs to improve their outputs by using more test-time computation is a critical step towards building generally self-improving agents that can operate on open-ended natural language. In this paper, we study the scaling of inference-time ...

I tested NotebookLM against my small, private LLM research assistant. The difference is amazing.

I tested my local, small LLM connected to my Zotero database in Open WebUI against NotebookLM. The results were really, truly surprising. Please Like and Subscribe to support the channel! @LearnMetaAnalysis I tested Granite 3.1-2b and Granite 3.1-8b vs NotebookLM. I am now more impressed than ever! If you find Granite hallucinating with the ...

Marker:Get Your PDFs Ready for RAG & LLMs|High Accuracy Open-Source Tool #ai #llm #pdf #generativeai

PDFs are essential in business, academics, and more for their consistent formatting, but extracting content can be tricky, especially with images, tables, and formulas. This is a key step in preparing text for RAG (Retrieval-Augmented Generation) applications and language models (LLMs). In this video, we’ll show you how converting PDFs to ...

PDF to JSON: LLM-Powered Data Extraction In Python

This video provides a step-by-step guide to help you learn how to use Large Language Models (LLMs) to extract data from PDFs and convert it into JSON format. The tutorial covers prompt engineering techniques and practical implementation steps. ℹ️ DIFFERENT CHAPTERS 0:00 - Introduction 00:42 - Use case workflow 1:56 - Usec case data and ...

Building Brain-Like Memory for AI | LLM Agent Memory Systems

Implementing multiple memory systems into Language Model Agents! Resources: Agentic Memory Repo - https://github.com/ALucek/agentic-memory/tree/main Types of Memory - https://www.psychologytoday.com/us/basics/memory/types-of-memory Cognitive Architectures for Language Agents - https://arxiv.org/pdf/2309.02427 A Survey on the Memory Mechanism of ...

Evaluate LLMs with Language Model Evaluation Harness

In this tutorial, I delve into the intricacies of evaluating large language models (LLMs) using the versatile Evaluation Harness tool. Explore how to rigorously test LLMs across diverse datasets and benchmarks, including HellaSWAG, TruthfulQA, Winogrande, and more. This video features the LLaMA 3 model by Meta AI and demonstrates step-by-step ...

YouTube

· May 12, 2024

Video thumbnail for AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

05:06

microsoft.com › research

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

We present AutoGen, an open-source framework that allows developers to build LLM applications by composing multiple agents to converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes that employ combinations of LLMs, human inputs, and tools. It also enables developers to create ...

The 10 Most Cited AI Research Papers of 2024

Check out HubSpot's FREE AI Prompt Library Now! https://clickhubspot.com/h14h In this video, I will be going through (more than) 10 research papers in the field of AI/ML with the most citations in the year 2024. My AI papers newsletter https://mail.bycloud.ai/ Check out my Patreon for the full list https://www.patreon.com/c/bycloud https://www ...

8 Must Read LLM Papers | Add to your Library

Large Language Models: A Survey https://arxiv.org/pdf/2402.06196.pdf Andrej Karpathy LLM Paper Reading List for LLM Mastery https://medium.com/towards-artificial-intelligence/andrej-karpathy-llm-paper-reading-list-for-llm-mastery-89e751ad0cc1?sk=de202ac6d46f4d13343458fe6e900eb0 ...

How I use LLMs

The example-driven, practical walkthrough of Large Language Models and their growing list of related features, as a new entry to my general audience series on LLMs. In this more practical followup, I take you through the many ways I use LLMs in my own life. Chapters 00:00:00 Intro into the growing LLM ecosystem 00:02:54 ChatGPT interaction ...

Does Prompt Formatting Have Any Impact on LLM Performance? | #ai #2024 #genai

Paper: https://arxiv.org/pdf/2411.10541.pdf This research paper investigates the impact of prompt formatting on the performance of OpenAI's GPT language models. The authors tested various formats (plain text, Markdown, JSON, YAML) across multiple tasks and datasets, finding significant performance variations, especially in smaller models like ...

Paper Explained | GNN boosts LLM accuracy via Contrastive Learning

Support me on Patreon where you can tell me what AI paper you want me to cover next! https://www.patreon.com/jacksee/membership Paper: LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Conclusion: This paper introduces TEA-GLM, a framework that enhances zero-shot learning in graph machine learning by ...

Google’s Med-Gemini Multimodal LLM: The Best Medical AI Model

Join me as I explore the ground-breaking advancements of Med-Gemini in the field of medical AI. In this video, I dive into some of the key findings and insights from the latest research, highlighting how Med-Gemini is setting new benchmarks in clinical reasoning, multimodal understanding, and long-context processing. Discover how these ...

The latest LLM research shows how they are getting SMARTER and FASTER.

System Design Course at InterviewReady: https://interviewready.io/ This is how LLMs are scaling up their test compute time to deliver better results. 00:00 Agenda 00:20 Scaling Law 1.0 01:48 Using smaller weights 03:44 Increasing model training 06:05 Google Titans 07:40 What's after transformers? 08:45 Neuromorphic Computing References: 1.58 ...

Beginner's Guide to LayoutLM Research Paper Explained | Document AI | NLP | CV | OCR #ai

LayoutLM: Pre-training of Text and Layout for Document Image Understanding," authored by Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, and Ming Zhou. The paper was presented at the Conference on Empirical Methods in Natural Language Processing (EMNLP) in 2020. LayoutLM extends the pre-training of language models to incorporate both ...

Developing an LLM: Building, Training, Finetuning

REFERENCES: 1. Build an LLM from Scratch book: https://amzn.to/4fqvn0D 2. Build an LLM from Scratch repo: https://github.com/rasbt/LLMs-from-scratch 3. Slides: https://sebastianraschka.com/pdf/slides/2024-build-llms.pdf 4. LitGPT: https://github.com/Lightning-AI/litgpt 5. TinyLlama pretraining: https://lightning.ai/lightning-ai/studios/pretrain ...

LLM Evaluation Basics: Datasets & Metrics

This is an introduction to evaluating Large Language Models (LLMs), which covers what a dataset is, how we measure performance, and how automatic and human evaluation are done.

Paper Explained | LLM boosts GNN Accuracy via Knowledge Distillation

Support me on Patreon where you can tell me what AI paper you want me to cover next! https://www.patreon.com/jacksee/membership Paper: Large Language Model Meets Graph Neural Network in Knowledge Distillation Conclusion: In this paper, we propose a novel LLM-to-GNN knowledge distillation framework termed LinguGKD, which integrates the semantic ...

Evolving Deeper LLM Thinking (Paper Walkthrough)

📖Paper: https://arxiv.org/abs/2501.09891 👥Authors: Kuang-Huei Lee, Ian Fischer, Yueh-Hua Wu, Dave Marwood, Shumeet Baluja, Dale Schuurmans, Xinyun Chen 🏫Institutes: Google DeepMind, UC San Diego, University of Alberta LLMs Evolving: Mind Evolution Makes Problem Solving a Breeze!🌀 This research introduces Mind Evolution, an ...

Understanding all about LLM with Llama 3.1 Paper

Join us as we unravel the complexities of large language models (LLM) by going through the Llama 3.1 paper! Learn how this groundbreaking research is shaping the future of Large Language Models (LLMs) while utilizing non-proprietary data for unparalleled performance. Whether you're a tech enthusiast or a seasoned AI professional, this video is ...

Research Paper Recommendation System and Subject Area Prediction Deep Learning LLM | ArXiv Data

#deeplearning #largelanguagemodels #machinelearning Source Code: https://github.com/611noorsaeed/Research-Papers-Recommendation-System-and-Subject-Area-Prediction-Using-Deep-Learning-and-LLMS/blob/main/Research Paper recommendation and subject area prediction using sentence transformer.ipynb Download Save Files (Models): https://www.kaggle.com ...

I Trained an LLM to Think Deeper (Here's How)

Turns out reinforcement learning is all you need Check out my prior video on RL: https://youtu.be/qTY4Rr-x5q0?si=pgTpw9r9xwkuZJM6 Resources: Code: https://github.com/ALucek/GRPO-Training/tree/main Model: https://huggingface.co/AdamLucek/Qwen2.5-3B-Instruct-GRPO-2K-GSM8K DeepSeek-R1 Paper: https://arxiv.org/pdf/2501.12948 DeepSeek Math Paper ...

GRPO 2.0? DAPO LLM Reinforcement Learning Explained

In this video, we break down DAPO: An Open-Source LLM Reinforcement Learning System at Scale — a new research paper from ByteDance that introduces DAPO (Decoupled Clip and Dynamic sAmpling Policy Optimization), a powerful reinforcement learning (RL) algorithm built on GRPO (Grouped Relative Policy Optimization). DAPO tackles key challenges in ...

CodeCoR An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation

A detailed breakdown of the AI research paper: CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation by Authors: Ruwei Pan, Hongyu Zhang, Chao Liu. Read the full article: https://arxiv.org/abs/2501.07811 Check out our free AI courses: https://theministryofai.org/courses-2/

LLM Numerical Reasoning: Strengths, Limits, and Number Sense

This paper, "Large Language Models in Numberland," examines how well large language models (LLMs) understand and reason with numbers by introducing a test called "Numberland". The test includes problems ranging from basic math to more complex tasks like checking if a number is prime and playing the game of 24. The researchers found that while ...

LangChain intro - train LLM on set of pdf files

Train a Large Language Model (LLM) on a set of pdf files to then ask questions about the content. In this example, we are training on instruction documents on how to file taxes, to then ask specific questions about tax filings. Learn how to use the LangChain package to build/train the appropriate index. code notebook: https://github.com ...

Agent Laboratory: Using LLM Agents as Research Assistants

reviews the research paper "Agent Laboratory: Using LLM Agents as Research Assistants" by Schmidgall et al. Agent Laboratory is a novel research framework that leverages Large Language Model (LLM) agents to automate various stages of the research process, from idea exploration to report generation. The goal of the project is to empower ...

Survey on Evaluation of LLM-based Agents (Mar 2025)

Title: Survey on Evaluation of LLM-based Agents (Mar 2025) Link: http://arxiv.org/abs/2503.16416v1 Date: March 2025 Summary: This paper provides the first comprehensive survey of evaluation methodologies for increasingly capable LLM-based agents. The survey analyzes evaluation benchmarks and frameworks across agent capabilities, application ...

Multi-LLM-Agent Systems: Techniques and Business Perspectives

This research paper explores multi-LLM-agent systems (MLAS), a new paradigm in artificial intelligence where multiple large language models (LLMs) act as autonomous agents, collaborating to solve complex tasks. The authors discuss the technical aspects of MLAS, including architecture, communication protocols, and agent training methods, while ...

Research Methodology || Important Questions || LLM-II || Kurukshetra University

Research Methodology || Important Questions || LLM-II || Kurukshetra University In this vlog, important questions on the subject of Research Methodology of LL.M IInd year of Kurukshetra University have been shared. This kind of vlog on other subjects have also been made available in the playlist of LL.M on this channel. Do subscribe, like and ...

Longish Term Paper | How to write LTP | Research Paper , LLB , LLM , Research Students , Ph.D

Video contains detailed method of writing the longish term paper.

A Survey of Techniques for Maximizing LLM Performance

Join us for a comprehensive survey of techniques designed to unlock the full potential of Language Model Models (LLMs). Explore strategies such as fine-tuning, RAG (Retrieval-Augmented Generation), and prompt engineering to maximize LLM performance. Speakers: John Allard Engineering Lead, Fine-tuning Product Team at @OpenAI Colin Jarvis ...

The Stochastic Parrot on LLM’s Shoulder: A Summative Assessment of Physical Concept Understanding

The study "A Summative Assessment of Physical Concept Understanding" checks if **large language models (LLMs) truly grasp physical concepts** like gravity. The researchers used a method called summative assessment, which involves testing different levels of understanding. The assessment, called PHYSICO, has two parts: one tests basic knowledge ...

LLM Explained | What is LLM

Simple and easy explanation of LLM or Large Language Model in less than 5 minutes. In this short video, you will build an intuition of how a large language model works using animation and simple story telling. This is the explanation that even a high school student can understand easily. Simple Explanation of Neural Network: https://www.youtube ...

YouTube

· Aug 22, 2023

mavii

Videos

DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI?

What Do LLM Benchmarks Actually Tell Us? (+ How to Run Your Own)

Paper: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

LLM Course – Build a Semantic Book Recommender (Python, OpenAI, LangChain, Gradio)

Understanding DeepSeek LLM Research Paper | How DeepSeek LLM Model Solves Problems | Open Source AI

Marker: This Open-Source Tool will make your PDFs LLM Ready

Build a Document Summarization App using LLM on CPU: No OpenAI ❌

PDF Summary with LLMs in Python - LangChain Tutorial

DAPO: An Open-Source LLM Reinforcement Learning System at Scale (Paper Walkthrough)

Lecture 2: Large Language Models (LLM) Basics

The 17 most awesome NLP and LLM research papers from last the 12 months!

Unstract - How to extract data from PDFs using LLMs

Breaking Down & Testing FIVE LLM Agent Architectures - (Reflexion, LATs, P&E, ReWOO, LLMCompiler)

GraphRAG: LLM-Derived Knowledge Graphs for RAG

LLM Interpretability: Exploring the Latest Research from OpenAI and Anthropic

Beyond the Basics: The Role of LLM in Modern Threat Intelligence

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters (Paper)

I tested NotebookLM against my small, private LLM research assistant. The difference is amazing.

Marker:Get Your PDFs Ready for RAG & LLMs|High Accuracy Open-Source Tool #ai #llm #pdf #generativeai

PDF to JSON: LLM-Powered Data Extraction In Python

Building Brain-Like Memory for AI | LLM Agent Memory Systems

Evaluate LLMs with Language Model Evaluation Harness

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

The 10 Most Cited AI Research Papers of 2024

8 Must Read LLM Papers | Add to your Library

How I use LLMs

Does Prompt Formatting Have Any Impact on LLM Performance? | #ai #2024 #genai

Paper Explained | GNN boosts LLM accuracy via Contrastive Learning

Google’s Med-Gemini Multimodal LLM: The Best Medical AI Model

The latest LLM research shows how they are getting SMARTER and FASTER.

Beginner's Guide to LayoutLM Research Paper Explained | Document AI | NLP | CV | OCR #ai

Developing an LLM: Building, Training, Finetuning

LLM Evaluation Basics: Datasets & Metrics

Paper Explained | LLM boosts GNN Accuracy via Knowledge Distillation

Evolving Deeper LLM Thinking (Paper Walkthrough)

Understanding all about LLM with Llama 3.1 Paper

Research Paper Recommendation System and Subject Area Prediction Deep Learning LLM | ArXiv Data

I Trained an LLM to Think Deeper (Here's How)

GRPO 2.0? DAPO LLM Reinforcement Learning Explained

CodeCoR An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation

LLM Numerical Reasoning: Strengths, Limits, and Number Sense

LangChain intro - train LLM on set of pdf files

Agent Laboratory: Using LLM Agents as Research Assistants

Survey on Evaluation of LLM-based Agents (Mar 2025)

Multi-LLM-Agent Systems: Techniques and Business Perspectives

Research Methodology || Important Questions || LLM-II || Kurukshetra University

Longish Term Paper | How to write LTP | Research Paper , LLB , LLM , Research Students , Ph.D

A Survey of Techniques for Maximizing LLM Performance

The Stochastic Parrot on LLM’s Shoulder: A Summative Assessment of Physical Concept Understanding

LLM Explained | What is LLM