NEWVectors or files. Pick a path.Start →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    215 models available

    Showing 145168 of 215 models

    Reinforcement Learning

    PKU-Alignment/beaver-7b-v3.0-reward

    295
    safe-rlhf
    Reinforcement Learning

    ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16

    292
    91
    transformers
    Reinforcement Learning

    divyasridoredla/traffic-incident-resilient

    292
    stable-baselines3
    Reinforcement Learning

    TianheWu/VisualQuality-R1-7B-preview

    290
    7
    Reinforcement Learning

    manlong/dqn-SpaceInvadersNoFrameskip-v4

    289
    stable-baselines3
    Reinforcement Learning

    mradermacher/sft_14B-GGUF

    288
    1
    transformers
    Reinforcement Learning

    mradermacher/Miner-4B-GGUF

    285
    transformers
    Reinforcement Learning

    mradermacher/SIRI-7B-high-i1-GGUF

    285
    transformers
    Reinforcement Learning

    mradermacher/nexus-1.5b-i1-GGUF

    284
    transformers
    Reinforcement Learning

    JohnRoger/SU-01-Q4_K_M-GGUF

    284
    3
    Reinforcement Learning

    sb3/tqc-FetchPickAndPlace-v1

    282
    2
    stable-baselines3
    Reinforcement Learning

    mradermacher/drkernel-14b-i1-GGUF

    282
    1
    transformers
    Reinforcement Learning

    mradermacher/NurseSim-Triage-Llama-3.2-3B-GGUF

    280
    1
    transformers
    Reinforcement Learning

    Sudhish-Poojary/ppo-LunarLander-v3

    279
    stable-baselines3
    Reinforcement Learning

    SSGoatt/poca-SoccerTwos

    279
    ml-agents
    Reinforcement Learning

    HoaAn2003/ppo-Huggy

    279
    ml-agents
    Reinforcement Learning

    mradermacher/SocialR1-4B-i1-GGUF

    278
    transformers
    Reinforcement Learning

    sb3/a2c-BreakoutNoFrameskip-v4

    276
    2
    stable-baselines3
    Reinforcement Learning

    mradermacher/Orsta-7B-i1-GGUF

    276
    transformers
    Reinforcement Learning

    mradermacher/KnowRL-Nemotron-1.5B-i1-GGUF

    275
    transformers
    Reinforcement Learning

    mradermacher/ATLAS-8B-Thinking-i1-GGUF

    275
    1
    transformers
    Reinforcement Learning

    mradermacher/MetaphorStar-3B-i1-GGUF

    273
    1
    transformers
    Reinforcement Learning

    RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora

    272
    Reinforcement Learning

    mradermacher/Autobool-Qwen4b-Reasoning-conceptual-GGUF

    269
    transformers
    7 / 9