Document Search System

Directory PDFs TXT Files Audio Files Check Notes
An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders PDF 2408.16032v1.txt AI_2408.16032v1_chinese.txt AI_2408.16032v1_english.txt AI_2408.16032v1_thai.txt
AI_2408.16032v1_chinese.wav
AI_2408.16032v1_english.wav
AI_2408.16032v1_thai.wav
DAPO: An Open-Source LLM Reinforcement Learning System at Scale PDF 2503.14476v1.txt AI_2503.14476v1_chinese.txt AI_2503.14476v1_english.txt AI_2503.14476v1_thai.txt
AI_2503.14476v1_chinese.wav
AI_2503.14476v1_english.wav
AI_2503.14476v1_thai.wav
DeepSeek_R1 PDF AI_DeepSeek_R1_chinese.txt AI_DeepSeek_R1_english.txt AI_DeepSeek_R1_thai.txt DeepSeek_R1.txt
AI_DeepSeek_R1_chinese.wav
AI_DeepSeek_R1_english.wav
AI_DeepSeek_R1_thai.wav
Evaluating_Large_Language_Models_Trained_on_Code PDF AI_Evaluating_Large_Language_Models_Trained_on_Code_chinese.txt AI_Evaluating_Large_Language_Models_Trained_on_Code_english.txt AI_Evaluating_Large_Language_Models_Trained_on_Code_thai.txt Evaluating_Large_Language_Models_Trained_on_Code.txt
AI_Evaluating_Large_Language_Models_Trained_on_Code_chinese.wav
AI_Evaluating_Large_Language_Models_Trained_on_Code_english.wav
AI_Evaluating_Large_Language_Models_Trained_on_Code_thai.wav
Framework for LLM applications in manufacturing PDF 1-s2.0-S2213846324000920-main.txt AI_1-s2.0-S2213846324000920-main_chinese.txt AI_1-s2.0-S2213846324000920-main_english.txt AI_1-s2.0-S2213846324000920-main_thai.txt
AI_1-s2.0-S2213846324000920-main_chinese.wav
AI_1-s2.0-S2213846324000920-main_english.wav
AI_1-s2.0-S2213846324000920-main_thai.wav
Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production PDF AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_chinese.txt AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_english.txt AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_thai.txt Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production.txt
AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_chinese.wav
AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_english.wav
AI_Integration_of_Deep_Reinforcement_Learning_and_Discrete-Event_Simulation_for_Real-Time_Scheduling_of_a_Flexible_Job_Shop_Production_thai.wav
Introduction to Reinforcement Learning PDF 2408.07712v3.txt AI_2408.07712v3_chinese.txt AI_2408.07712v3_english.txt AI_2408.07712v3_thai.txt
AI_2408.07712v3_chinese.wav
AI_2408.07712v3_english.wav
AI_2408.07712v3_thai.wav
Knowledge sharing in manufacturing using LLM-powered tools User study and model benchmarking PDF AI_frai-07-1293084_chinese.txt AI_frai-07-1293084_english.txt AI_frai-07-1293084_thai.txt frai-07-1293084.txt
AI_frai-07-1293084_chinese.wav
AI_frai-07-1293084_english.wav
AI_frai-07-1293084_thai.wav
LLM-based Multi-Agent Reinforcement Learning Current and Future Directions PDF 2405.11106v1.txt AI_2405.11106v1_chinese.txt AI_2405.11106v1_english.txt AI_2405.11106v1_thai.txt
AI_2405.11106v1_chinese.wav
AI_2405.11106v1_english.wav
AI_2405.11106v1_thai.wav
Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm PDF AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_chinese.txt AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_english.txt AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_thai.txt Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm.txt
AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_chinese.wav
AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_english.wav
AI_Mastering_Chess_and_Shogi_by_Self-Play_with_a_General_Reinforcement_Learning_Algorithm_thai.wav
Reinforcement Learning A Friendly1 PDF AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_chinese.txt AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_english.txt AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_thai.txt Daoun2022_Chapter_ReinforcementLearningAFriendly1.txt
AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_chinese.wav
AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_english.wav
AI_Daoun2022_Chapter_ReinforcementLearningAFriendly1_thai.wav
Reinforcement_Learning_Advancements_Limitations_an PDF AI_Reinforcement_Learning_Advancements_Limitations_an_chinese.txt AI_Reinforcement_Learning_Advancements_Limitations_an_english.txt AI_Reinforcement_Learning_Advancements_Limitations_an_thai.txt Reinforcement_Learning_Advancements_Limitations_an.txt
AI_Reinforcement_Learning_Advancements_Limitations_an_chinese.wav
AI_Reinforcement_Learning_Advancements_Limitations_an_english.wav
AI_Reinforcement_Learning_Advancements_Limitations_an_thai.wav
Reinforcement_Learning_Enhanced_LLMs_A_Survey PDF - -
TeaMs-RL Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning PDF 2403.08694v4.txt AI_2403.08694v4_chinese.txt AI_2403.08694v4_english.txt AI_2403.08694v4_thai.txt
AI_2403.08694v4_chinese.wav
AI_2403.08694v4_english.wav
AI_2403.08694v4_thai.wav
bitcoin PDF AI_bitcoin_chinese.txt AI_bitcoin_english.txt AI_bitcoin_thai.txt bitcoin.txt
AI_bitcoin_chinese.wav
AI_bitcoin_english.wav
AI_bitcoin_thai.wav