NEWVectors or files. Pick a path.Start →

    Reinforcement Learning Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    215 models available

    Showing 4972 of 215 models

    Reinforcement Learning

    mradermacher/MINT-empathy-Qwen3-4B-GGUF

    719
    1
    transformers
    Reinforcement Learning

    mradermacher/Dynamical-30B-A3B-GGUF

    692
    transformers
    Reinforcement Learning

    mradermacher/PRIMO-R1-7B-GGUF

    692
    1
    transformers
    Reinforcement Learning

    ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

    683
    228
    transformers
    Reinforcement Learning

    mradermacher/SpatialThinker-30B-GGUF

    677
    transformers
    Reinforcement Learning

    graceesthi/ug-cppo-finai-2025

    669
    stable-baselines3
    Reinforcement Learning

    mradermacher/Aryabhata-1.0-GGUF

    660
    1
    transformers
    Reinforcement Learning

    mradermacher/Vero-Qwen35-9B-GGUF

    653
    transformers
    Reinforcement Learning

    mradermacher/LongTraceRL-30B-GGUF

    650
    transformers
    Reinforcement Learning

    mradermacher/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF

    643
    transformers
    Reinforcement Learning

    mradermacher/GoLongRL-4B-GGUF

    640
    transformers
    Reinforcement Learning

    mradermacher/LongWriter-Zero-32B-GGUF

    625
    3
    transformers
    Reinforcement Learning

    mradermacher/Spreadsheet-RL-4B-GGUF

    622
    transformers
    Reinforcement Learning

    mradermacher/AReaL-SEA-235B-A22B-GGUF

    611
    transformers
    Reinforcement Learning

    mradermacher/P1-30B-A3B-GGUF

    608
    1
    transformers
    Reinforcement Learning

    infly/inf-query-aligner

    607
    8
    Reinforcement Learning

    igpaub/ppo-CarRacing-v2

    604
    stable-baselines3
    Reinforcement Learning

    mradermacher/LiteResearcher-4B-GGUF

    589
    transformers
    Reinforcement Learning

    mradermacher/DeepHermes-Egregore-8B-131K-i1-GGUF

    572
    1
    transformers
    Reinforcement Learning

    mradermacher/Agent-STAR-RL-7B-i1-GGUF

    567
    1
    transformers
    Reinforcement Learning

    Abc8264/TutorAI-Chemistry-Phi4

    567
    1
    Reinforcement Learning

    ValueFX9507/Tifa-DeepsexV3-14b-GGUF-Q6

    552
    45
    transformers
    Reinforcement Learning

    Tunamelon/ppo-LunarLander-v2

    552
    stable-baselines3
    Reinforcement Learning

    THU-KEG/LLaDA-8B-BGPO-countdown

    551
    1
    3 / 9