Về trang chủ

Sử dụng GPU trong Cloud Run

Nguồn: https://www.youtube.com/watch?v=IY-z00bfnOc

Tác giả: Google Cloud Tech

Ngày xuất bản: 2024-10-03T00:00:00

Length: 08:44

Sign up for the preview → https://goo.gle/3NnobXv

GPU best practices → https://goo.gle/4elRpBE

Run LLM inference on Cloud Run GPUs with Ollama → https://goo.gle/3BwN6F1

Cloud Run, known for its scalability, now incorporates GPUs, ushering in a new era for machine learning inference. Join Googlers Martin Omander and Wietse Venema as they provide a practical demonstration of deploying Google's Gemma 2, an open-source large language model, through Ollama on Cloud Run.

Chapters:

0:00 - Intro

0:22 - Google Vertex AI vs GPUs with Cloud Run

1:12 - AI app architecture

2:04 - [Demo] Deploying Ollama API

3:26 - [Demo] Testing the deployment

5:28 - [Demo] Build & deploy the front end

6:02 - How do GPUs scale on Cloud Run?

6:34 - Where are Gemma 2 model files stored?

7:12 - Getting started with GPUs in Cloud Run

More Resources:

Cloud Run pricing → https://goo.gle/3BeMhAD

Watch more Serverless Expeditions → https://goo.gle/ServerlessExpeditions

Subscribe to Google Cloud Tech → https://goo.gle/GoogleCloudTech

#ServerlessExpeditions #GoogleCloud

Speaker: Martin Omander, Wietse Venema

Products Mentioned: Cloud Run, Gemma

Dịch Vào Lúc: 2025-04-01T02:18:34Z

Video Đề Xuất

11:19

Người Mỹ đang 'bắt đầu cảm thấy' tác động tiêu cực từ thuế quan của Trump - MSNBC

30:20

BẮC TRIỀU TIÊN SẮP BIẾN MẤT KHỎI BẢN ĐỒ! Giải thích về Khủng hoảng Nhân khẩu học - Business Basics

20:25

.NET 8 Xác thực bằng Identity trong Web API với Mã thông báo Bearer & Cookie 🔒 - Patrick God

21:23

Gemini 2.5 Flash đã ra mắt! Kiểm tra kỹ lưỡng Âm thanh, Video, Văn bản, Hình ảnh miễn phí - Fahd Mirza

04:10

Putin và Tập Cận Bình Nhắm Mục Tiêu vào Trật Tự do Mỹ Dẫn Đầu - Bloomberg Television

10:49

Mọi thứ mà một .NET Developer cần biết vào năm 2025 - Nick Chapsas

05:32

Cách Tiếp Thu Sách Nhanh Gấp 3 Lần Trong 7 Ngày (Từ Một Sinh Viên Y Khoa) - Salim Ahmed

21:06

Sự thống trị nguy hiểm của Samsung đối với Hàn Quốc - Wendover Productions

19:15

Khóa học A2A #2 - Xây dựng Máy khách & Máy chủ A2A, đây là cách thực hiện - theailanguage

10:15

10 Kỹ Năng AI Thu Nhập Cao Nên Học Trong Năm 2025 - Dr Alex Young

11:34

Nếu Bạn Muốn Học Nhanh Hơn 99% Mọi Người, Hãy Sao Chép Tôi (Từ Một Giám Đốc Tại Amazon) - A Life Engineered

20:14

Ứng dụng Trợ lý giọng nói AI với Llama3, RAG, STT và TTS để trò chuyện bằng giọng nói theo thời gian thực với Tài liệu - Sandip's Technology Channel