Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thangtm 's Collections
flow_matching_model
reasoning_model
DLM
RL
ARC
RAG
Reduce_thinking
OCR

reasoning_model

updated 1 day ago
Upvote
-

  • OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

    Paper • 2511.16334 • Published 27 days ago • 91

  • Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

    Paper • 2509.07980 • Published Sep 9 • 101

  • ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute

    Paper • 2509.04475 • Published Aug 30 • 3

  • Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

    Paper • 2512.01374 • Published 17 days ago • 90

  • DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

    Paper • 2511.22570 • Published 20 days ago • 76

  • ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

    Paper • 2512.07843 • Published 23 days ago • 19

  • Apriel-1.5-15b-Thinker

    Paper • 2510.01141 • Published Oct 1 • 119

  • SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

    Paper • 2504.11468 • Published Apr 10 • 30

  • Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

    Paper • 2501.09686 • Published Jan 16 • 41

  • OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

    Paper • 2410.09671 • Published Oct 12, 2024 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs