NEWVectors or files. Pick a path.Start →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    215 models available

    Showing 121144 of 215 models

    Reinforcement Learning

    sb3/dqn-SeaquestNoFrameskip-v4

    348
    stable-baselines3
    Reinforcement Learning

    mradermacher/Vero-Qwen3I-8B-GGUF

    337
    transformers
    Reinforcement Learning

    mradermacher/ReForm-SFT-1.5B-i1-GGUF

    337
    transformers
    Reinforcement Learning

    mradermacher/PulseMind-72B-GGUF

    336
    transformers
    Reinforcement Learning

    mradermacher/HER-32B-ACL-GGUF

    332
    transformers
    Reinforcement Learning

    mradermacher/InfiGUI-G1-7B-i1-GGUF

    331
    1
    transformers
    Reinforcement Learning

    mradermacher/LiteResearcher-4B-i1-GGUF

    330
    transformers
    Reinforcement Learning

    mradermacher/Vero-MiMo-7B-GGUF

    330
    1
    transformers
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-1.5B-GGUF

    330
    transformers
    Reinforcement Learning

    sb3/ppo-Pendulum-v1

    329
    3
    stable-baselines3
    Reinforcement Learning

    mradermacher/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb-i1-GGUF

    327
    4
    transformers
    Reinforcement Learning

    mradermacher/Metis-8B-RL-GGUF

    319
    1
    transformers
    Reinforcement Learning

    0sunfire0/poca-SoccerTwos_00

    319
    ml-agents
    Reinforcement Learning

    mradermacher/Miner-8B-GGUF

    317
    transformers
    Reinforcement Learning

    mradermacher/arc-teacher-8b-i1-GGUF

    317
    1
    transformers
    Reinforcement Learning

    ccnets/causal-gpt-rl

    316
    2
    pytorch
    Reinforcement Learning

    pat-jj/s3-8-3-3-20steps

    313
    transformers
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-7B-GGUF

    309
    1
    transformers
    Reinforcement Learning

    edbeeching/decision-transformer-gym-hopper-expert

    299
    20
    transformers
    Reinforcement Learning

    mradermacher/AutoGEO_mini_Qwen1.7B-i1-GGUF

    297
    transformers
    Reinforcement Learning

    mradermacher/Orsta-7B-GGUF

    297
    transformers
    Reinforcement Learning

    antof27/RL-course

    297
    stable-baselines3
    Reinforcement Learning

    mradermacher/StarPO-4B-i1-GGUF

    297
    1
    transformers
    Reinforcement Learning

    Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B

    296
    transformers
    6 / 9