NEWVectors or files. Pick a path.Start →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    215 models available

    Showing 169192 of 215 models

    Reinforcement Learning

    mradermacher/Tifa-Deepsex-14b-CoT-GGUF

    267
    23
    transformers
    Reinforcement Learning

    sb3/dqn-CartPole-v1

    267
    stable-baselines3
    Reinforcement Learning

    ValueFX9507/Tifa-Deepsex-14b-CoT

    265
    224
    transformers
    Reinforcement Learning

    realambuj2001/schemaquake1-lora

    265
    transformers
    Reinforcement Learning

    XunmeiLiu/VFIG-4B

    263
    5
    transformers
    Reinforcement Learning

    dlantonia/ppo-FrozenLake-v1

    263
    stable-baselines3
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-3B-GGUF

    262
    transformers
    Reinforcement Learning

    MoHan136/ppo-Huggy-class

    261
    ml-agents
    Reinforcement Learning

    DocPereira/PEAL_V4_LHP_Zero_Entropy_Controlled

    260
    1
    Reinforcement Learning

    sb3/ppo-LunarLander-v2

    260
    stable-baselines3
    Reinforcement Learning

    mradermacher/Qwen3-0.6B-ReMax-GGUF

    256
    transformers
    Reinforcement Learning

    mradermacher/SEOcrate-4B_grpo_new_01-i1-GGUF

    252
    transformers
    Reinforcement Learning

    voyzan/poca-SoccerTwos

    251
    ml-agents
    Reinforcement Learning

    mradermacher/Tifa-DeepsexV2-7b-MGRPO-safetensors-i1-GGUF

    249
    transformers
    Reinforcement Learning

    mradermacher/Vero-Qwen3I-8B-i1-GGUF

    247
    transformers
    Reinforcement Learning

    imran785/medical-triage-qwen-3b-trained

    245
    1
    transformers
    Reinforcement Learning

    mradermacher/inframind-0.5b-dapo-GGUF

    244
    transformers
    Reinforcement Learning

    mradermacher/eubiota-planner-8b-i1-GGUF

    244
    transformers
    Reinforcement Learning

    Snowflake/Arctic-AWM-4B

    243
    8
    Reinforcement Learning

    mradermacher/ATLAS-8B-Thinking-GGUF

    243
    2
    transformers
    Reinforcement Learning

    reebop/ppo-LunarLander-v3

    243
    stable-baselines3
    Reinforcement Learning

    sb3/dqn-BreakoutNoFrameskip-v4

    242
    2
    stable-baselines3
    Reinforcement Learning

    mradermacher/story_generation_Qwen3_8B_RL-i1-GGUF

    242
    transformers
    Reinforcement Learning

    SamuelM0422/ppo-SolarTracker

    241
    ml-agents
    8 / 9