Về trang chủ

SWE-RL của Meta — Học tăng cường cho LLM Kỹ thuật Phần mềm

Nguồn: https://www.youtube.com/watch?v=DimD-eX0DW8

Tác giả: AI Papers Academy

Ngày xuất bản: 2025-02-28T00:00:00

Length: 08:31

Tóm tắt nội dung

Mô tả

In this video, we dive into a new Meta research paper: "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution".

This paper introduces SWE-RL, a new reinforcement learning method for real-world software engineering. By training large language models (LLMs) directly on the evolution of real GitHub projects, SWE-RL can empower LLM to be better at software engineering.

We break down:

• How Meta curated 11 million pull requests from GitHub.

• SWE-RL training pipeline.

• SWE-RL state-of-the-art results on SWE-bench Verified for open-source models under 100B parameters.

🔗 Written Review: https://aipapersacademy.com/swe-rl/

🔗 Paper Link: https://arxiv.org/abs/2502.18449

🔗 Code: https://github.com/facebookresearch/swe-rl

___________________

🔔 Subscribe for more AI paper reviews!

📩 Join the newsletter → https://aipapersacademy.com/newsletter/

Become a patron - https://www.patreon.com/aipapersacademy

The video was edited using VideoScribe - https://tidd.ly/44TZEiX

___________________

#airesearch #metaai #swe_rl #reinforcementlearning #llm

Chapters:

0:00 Introduction

1:15 GitHub PRs Curation

3:20 SWE-RL Training

5:42 Aha Moments

6:39 SWE-RL Results

Dịch Vào Lúc: 2025-03-02T03:21:43Z

Video Đề Xuất

34:54

Google Ai Mode - Nguồn Lợi Nhuận Tiềm Ẩn - Kiếm Tiền Với Google :) - affiliatemarketingmc

11:22

AI và Học Ngoại Ngữ… Học Ngoại Ngữ Có Còn Hợp Lý Nữa Không? - Dustin Schermaul

13:23

Xử lý luồng là gì? + Khi nào chúng ta nên sử dụng nó? | Phỏng vấn Thiết kế Hệ thống từ 0 đến 1 với cựu Kỹ sư phần mềm Google - Jordan has no life

12:05

Đồ thị tri thức - Computerphile - Computerphile

14:47

[CoA] TÓM TẮT THÔNG TIN BẢN CẬP NHẬT NGÀY 25 THÁNG 6 - Nellan

12:21

Carlos Ghosn nói về các cuộc đàm phán Nissan-Honda, xe điện Trung Quốc, Trump - Bloomberg Television

13:01

Lec-0: Thảo luận về Giáo trình Hệ điều hành cho tất cả các kỳ thi Cao đẳng/Đại học & Cạnh tranh (GATE, NET) - Gate Smashers

20:00

Thiết kế Màn hình Đăng nhập Hiện đại Android với Chia sẻ Hoạt ảnh trong Android Studio 2021 - Coding With T

11:05

“Người đàn ông hoàn hảo” - 5 đặc điểm mà hầu hết phụ nữ thấy cực kỳ hấp dẫn - Charisma on Command

07:44

Em bé nhận phương pháp điều trị chỉnh sửa gen cá nhân hóa đầu tiên trên thế giới | DW News - DW News

12:49

Phần 2 - Thiết lập MCP Client trong 10 phút - Jack Herrington

18:55

Kỹ thuật Điên Rồ của SR-71 Blackbird - Real Engineering

vie v3 kor v3 jap v3