379dl/month
Identifier
Model ID
YuvrajSingh9886/Qwen2.5-0.5B-grpo-summarization-length-onlyTags
mlxsafetensorsqwen2grposummarizationreinforcement-learninglength-penalty-includedendataset:mlabonne/smoltldrbase_model:Qwen/Qwen2.5-0.5B-Instructbase_model:finetune:Qwen/Qwen2.5-0.5B-Instructlicense:apache-2.0region:us
Use Qwen2.5-0.5B-grpo-summarization-length-only on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
OrganizationYuvrajSingh9886
TaskSummarization
Librarymlx
Licenseapache-2.0
Downloads/mo379
View on HuggingFace
See model card, files, and community discussion