| Directory | PDFs | TXT Files | Audio Files | Check | Notes |
|---|---|---|---|---|---|
| An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders | 2408.16032v1.txt AI_2408.16032v1_chinese.txt AI_2408.16032v1_english.txt AI_2408.16032v1_thai.txt |
AI_2408.16032v1_chinese.wav
AI_2408.16032v1_english.wav
AI_2408.16032v1_thai.wav
|
◻ | ||
| DAPO: An Open-Source LLM Reinforcement Learning System at Scale | 2503.14476v1.txt AI_2503.14476v1_chinese.txt AI_2503.14476v1_english.txt AI_2503.14476v1_thai.txt |
AI_2503.14476v1_chinese.wav
AI_2503.14476v1_english.wav
AI_2503.14476v1_thai.wav
|
◻ | ||
| DeepSeek_R1 | AI_DeepSeek_R1_chinese.txt AI_DeepSeek_R1_english.txt AI_DeepSeek_R1_thai.txt DeepSeek_R1.txt |
AI_DeepSeek_R1_chinese.wav
AI_DeepSeek_R1_english.wav
AI_DeepSeek_R1_thai.wav
|
◻ | ||
| Evaluating_Large_Language_Models_Trained_on_Code | AI_Evaluating_Large_Language_Models_Trained_on_Code_chinese.txt AI_Evaluating_Large_Language_Models_Trained_on_Code_english.txt AI_Evaluating_Large_Language_Models_Trained_on_Code_thai.txt Evaluating_Large_Language_Models_Trained_on_Code.txt |
AI_Evaluating_Large_Language_Models_Trained_on_Code_chinese.wav
AI_Evaluating_Large_Language_Models_Trained_on_Code_english.wav
AI_Evaluating_Large_Language_Models_Trained_on_Code_thai.wav
|
◻ | ||
| Framework for LLM applications in manufacturing | 1-s2.0-S2213846324000920-main.txt AI_1-s2.0-S2213846324000920-main_chinese.txt AI_1-s2.0-S2213846324000920-main_english.txt AI_1-s2.0-S2213846324000920-main_thai.txt |
AI_1-s2.0-S2213846324000920-main_chinese.wav
AI_1-s2.0-S2213846324000920-main_english.wav
AI_1-s2.0-S2213846324000920-main_thai.wav
|
◻ | ||
| Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production | AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_chinese.txt AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_english.txt AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_thai.txt Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production.txt |
AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_chinese.wav
AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_english.wav
AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_thai.wav
|
◻ | ||
| Introduction to Reinforcement Learning | 2408.07712v3.txt AI_2408.07712v3_chinese.txt AI_2408.07712v3_english.txt AI_2408.07712v3_thai.txt |
AI_2408.07712v3_chinese.wav
AI_2408.07712v3_english.wav
AI_2408.07712v3_thai.wav
|
◻ | ||
| Knowledge sharing in manufacturing using LLM-powered tools User study and model benchmarking | AI_frai-07-1293084_chinese.txt AI_frai-07-1293084_english.txt AI_frai-07-1293084_thai.txt frai-07-1293084.txt |
AI_frai-07-1293084_chinese.wav
AI_frai-07-1293084_english.wav
AI_frai-07-1293084_thai.wav
|
◻ | ||
| LLM-based Multi-Agent Reinforcement Learning Current and Future Directions | 2405.11106v1.txt AI_2405.11106v1_chinese.txt AI_2405.11106v1_english.txt AI_2405.11106v1_thai.txt |
AI_2405.11106v1_chinese.wav
AI_2405.11106v1_english.wav
AI_2405.11106v1_thai.wav
|
◻ | ||
| Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm | AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_chinese.txt AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_english.txt AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_thai.txt Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm.txt |
AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_chinese.wav
AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_english.wav
AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_thai.wav
|
◻ | ||
| Reinforcement Learning A Friendly1 | AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_chinese.txt AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_english.txt AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_thai.txt Daoun2022_Chapter_ReinforcementLearningAFriendly1.txt |
AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_chinese.wav
AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_english.wav
AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_thai.wav
|
◻ | ||
| Reinforcement_Learning_Advancements_Limitations_an | AI_Reinforcement_Learning_Advancements_Limitations_an_chinese.txt AI_Reinforcement_Learning_Advancements_Limitations_an_english.txt AI_Reinforcement_Learning_Advancements_Limitations_an_thai.txt Reinforcement_Learning_Advancements_Limitations_an.txt |
AI_Reinforcement_Learning_Advancements_Limitations_an_chinese.wav
AI_Reinforcement_Learning_Advancements_Limitations_an_english.wav
AI_Reinforcement_Learning_Advancements_Limitations_an_thai.wav
|
◻ | ||
| Reinforcement_Learning_Enhanced_LLMs_A_Survey | - | - | ◻ | ||
| TeaMs-RL Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning | 2403.08694v4.txt AI_2403.08694v4_chinese.txt AI_2403.08694v4_english.txt AI_2403.08694v4_thai.txt |
AI_2403.08694v4_chinese.wav
AI_2403.08694v4_english.wav
AI_2403.08694v4_thai.wav
|
◻ | ||
| bitcoin | AI_bitcoin_chinese.txt AI_bitcoin_english.txt AI_bitcoin_thai.txt bitcoin.txt |
AI_bitcoin_chinese.wav
AI_bitcoin_english.wav
AI_bitcoin_thai.wav
|
✓ |