Chúng tôi không thể tìm thấy kết nối internet
Đang cố gắng kết nối lại
Có lỗi xảy ra!
Hãy kiên nhẫn trong khi chúng tôi khắc phục sự cố
Qwen3 với Hiệu suất VƯỢT TRỘI. Nó ĐÁNH BẠI DeepSeek R1!
# Qwen 3: Revolutionary Open-Weight AI Model Outperforms Leading Competitors!
In this video, we explore Qwen 3, the groundbreaking open-weight AI model that outperforms OpenAI O1, OpenAI O3 Mini, and even Deepseek R1! Released under the Apache 2.0 license for commercial use, Qwen 3 represents a significant advancement in AI technology.
## Model Variants:
• 8 total models (6 dense models, 2 mixture of experts)
• Size range from 6B to 32B parameters
• MOE models: 30B (3B active) and 235B (22B active)
## Key Features:
• Hybrid Thinking: Toggle between "thinking mode" for complex reasoning and "non-thinking mode" for quick responses
• Stable Thinking Budget Control: Efficiently manage computational resources
• Multilingual Support: Handles 119 different languages
• Significantly improved accuracy with thinking mode enabled
## Training Innovations:
• 2x larger dataset compared to Qwen 2.5
• Diverse data sources: web, PDF documents, and synthetic data
• Three-stage training process focusing on:
1) Basic language skills
2) Coding and reasoning
3) High-quality long context understanding
## Post-Training Refinements:
• Chain-of-thought reasoning
• Reasoning reinforcement learning
• Thinking mode fusion
• General reinforcement learning
• Knowledge distillation for smaller models
## How to Use:
• Available through Hugging Face Transformers, SGLang, and VLM
• Enable thinking mode with "/think" (default mode)
• Disable thinking with "/nothink"
• Excellent tool-calling and agentic capabilities
• MCP support for enhanced functionality
## Try It Yourself:
Qwen 3 is available on Hugging Face and via Qwen Chat with both MOE and smaller versions. Code available on GitHub!
What do you think about Qwen 3? Let me know in the comments below!
If you enjoyed this video, check out my other video about OpenAI's O3 model!
#AI #MachineLearning #LanguageModels #Qwen3 #AITechnology #OpenSource #ArtificialIntelligence #NLP #DeepLearning
Timestamp:
0:00 - Introduction to Qwen3
0:22 - Model Variants & Sizes
0:49 - Key Features
1:35 - Training Process
2:32 - How to Use Qwen3
2:59 - Tool Calling & Availability
3:20 - Demo of Qwen 3
3:54 - Testing Examples
4:32 - Conclusion
Dịch Vào Lúc: 2025-04-29T00:12:34Z