NEWAgents can now see video via MCP.Try it now →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    106 models available

    Showing 124 of 106 models

    Reinforcement Learning

    HumanCompatibleAI/ppo-seals-CartPole-v0

    58K
    16
    stable-baselines3
    Reinforcement Learning

    HumanCompatibleAI/ppo-Pendulum-v1

    53K
    5
    stable-baselines3
    Reinforcement Learning

    TianheWu/VisualQuality-R1-7B

    51K
    11
    Reinforcement Learning

    sb3/sac-BipedalWalkerHardcore-v3

    18K
    stable-baselines3
    Reinforcement Learning

    mradermacher/AReaL-SEA-235B-A22B-i1-GGUF

    15K
    transformers
    Reinforcement Learning

    mradermacher/Miner-8B-i1-GGUF

    9K
    transformers
    Reinforcement Learning

    ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8

    9K
    202
    transformers
    Reinforcement Learning

    infly/inf-retriever-v1-pro

    7K
    6
    Reinforcement Learning

    mradermacher/Vero-Qwen25-7B-i1-GGUF

    5K
    transformers
    Reinforcement Learning

    mradermacher/Vero-Qwen3I-8B-i1-GGUF

    5K
    transformers
    Reinforcement Learning

    mradermacher/Vero-MiMo-7B-i1-GGUF

    5K
    2
    transformers
    Reinforcement Learning

    mradermacher/HER-32B-i1-GGUF

    5K
    transformers
    Reinforcement Learning

    mradermacher/Pluto-i1-GGUF

    4K
    transformers
    Reinforcement Learning

    mradermacher/Miner-4B-i1-GGUF

    3K
    transformers
    Reinforcement Learning

    Open-Reasoner-Zero/Open-Reasoner-Zero-7B

    3K
    33
    transformers
    Reinforcement Learning

    nicklashansen/newt

    2K
    2
    Reinforcement Learning

    mradermacher/MediX-R1-2B-i1-GGUF

    2K
    transformers
    Reinforcement Learning

    mradermacher/Vero-Qwen3T-8B-i1-GGUF

    2K
    transformers
    Reinforcement Learning

    mradermacher/GALAX-i1-GGUF

    2K
    transformers
    Reinforcement Learning

    mradermacher/Miner-8B-GGUF

    2K
    transformers
    Reinforcement Learning

    edbeeching/decision-transformer-gym-hopper-medium

    2K
    7
    transformers
    Reinforcement Learning

    mradermacher/ToolOmni-Qwen3-4B-i1-GGUF

    2K
    1
    transformers
    Reinforcement Learning

    ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

    2K
    829
    transformers
    Reinforcement Learning

    PKU-Alignment/beaver-7b-v1.0-cost

    2K
    10
    safe-rlhf
    1 / 5