NEWAgents can now see video via MCP.Try it now →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    106 models available

    Showing 7396 of 106 models

    Reinforcement Learning

    mradermacher/P1-30B-A3B-GGUF

    462
    1
    transformers
    Reinforcement Learning

    sb3/ppo-CartPole-v1

    437
    stable-baselines3
    Reinforcement Learning

    0sunfire0/poca-SoccerTwos_00

    437
    ml-agents
    Reinforcement Learning

    mradermacher/ProtoCycle-7B-GGUF

    415
    transformers
    Reinforcement Learning

    mradermacher/MINT-empathy-Qwen3-1.7B-GGUF

    410
    transformers
    Reinforcement Learning

    kerenOrr/ppo-LunarLander-v2

    395
    stable-baselines3
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-7B-GGUF

    389
    1
    transformers
    Reinforcement Learning

    Sudhish-Poojary/ppo-LunarLander-v3

    378
    stable-baselines3
    Reinforcement Learning

    mradermacher/ReForm-SFT-3B-i1-GGUF

    375
    transformers
    Reinforcement Learning

    ValueFX9507/Tifa-Deepsex-14b-CoT-Q8

    361
    186
    transformers
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-1.5B-GGUF

    330
    transformers
    Reinforcement Learning

    mradermacher/ATLAS-8B-Thinking-i1-GGUF

    328
    1
    transformers
    Reinforcement Learning

    voyzan/poca-SoccerTwos

    322
    ml-agents
    Reinforcement Learning

    mradermacher/SIRI-7B-high-i1-GGUF

    318
    transformers
    Reinforcement Learning

    edbeeching/decision-transformer-gym-hopper-expert

    310
    19
    transformers
    Reinforcement Learning

    ValueFX9507/Tifa-Deepsex-14b-CoT

    294
    224
    transformers
    Reinforcement Learning

    divyasridoredla/traffic-incident-resilient

    291
    stable-baselines3
    Reinforcement Learning

    infly/inf-query-aligner

    285
    8
    Reinforcement Learning

    sb3/tqc-FetchPickAndPlace-v1

    282
    2
    stable-baselines3
    Reinforcement Learning

    SSGoatt/poca-SoccerTwos

    279
    ml-agents
    Reinforcement Learning

    AllIllusion/LunarLander-v3

    266
    stable-baselines3
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-3B-GGUF

    262
    transformers
    Reinforcement Learning

    HoaAn2003/ppo-Huggy

    262
    ml-agents
    Reinforcement Learning

    XunmeiLiu/VFIG-4B

    259
    5
    transformers
    4 / 5