NEWVectors or files. Pick a path.Start →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    215 models available

    Showing 193215 of 215 models

    Reinforcement Learning

    mradermacher/Pluto-GGUF

    240
    1
    transformers
    Reinforcement Learning

    sb3/ppo-CarRacing-v0

    240
    stable-baselines3
    Reinforcement Learning

    mradermacher/CIM-Qwen2-VL-7B-GGUF

    238
    transformers
    Reinforcement Learning

    wanglab/bioreason-pro-rl

    236
    7
    Reinforcement Learning

    mradermacher/IntelliAsk-Qwen3-32B-450-Merged-GGUF

    232
    transformers
    Reinforcement Learning

    mradermacher/CIM-Qwen2-VL-7B-SFT-GGUF

    227
    1
    transformers
    Reinforcement Learning

    mradermacher/Vero-Qwen3T-8B-i1-GGUF

    223
    transformers
    Reinforcement Learning

    Raiden-1001/poca-Soccerv7

    222
    ml-agents
    Reinforcement Learning

    mradermacher/PRIMO-COT-SFT-7B-GGUF

    221
    2
    transformers
    Reinforcement Learning

    mradermacher/R-PRM-7B-DPO-i1-GGUF

    221
    transformers
    Reinforcement Learning

    mradermacher/Vero-Qwen25-7B-i1-GGUF

    220
    transformers
    Reinforcement Learning

    mradermacher/HER-32B-i1-GGUF

    220
    transformers
    Reinforcement Learning

    RLinf/RLinf-OpenVLAOFT-LIBERO-130

    217
    3
    Reinforcement Learning

    sb3/sac-HalfCheetah-v3

    216
    2
    stable-baselines3
    Reinforcement Learning

    mradermacher/Miner-8B-i1-GGUF

    212
    transformers
    Reinforcement Learning

    cjksofm/ppo-LunarLander-v3

    210
    stable-baselines3
    Reinforcement Learning

    mradermacher/PulseMind-72B-i1-GGUF

    204
    2
    transformers
    Reinforcement Learning

    mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-GGUF

    204
    1
    transformers
    Reinforcement Learning

    mradermacher/ToolOmni-Qwen3-4B-i1-GGUF

    203
    1
    transformers
    Reinforcement Learning

    tarmus/hw3-rl-models

    201
    stable-baselines3
    Reinforcement Learning

    werdunkel/losingit

    200
    stable-baselines3
    Reinforcement Learning

    Srgreen/ppo-LunarLander-v3

    198
    stable-baselines3
    Reinforcement Learning

    mradermacher/Miner-4B-i1-GGUF

    191
    transformers
    9 / 9