Kevin King PRO
NeoCodes-dev
·
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
posted
an
update
1 day ago
Hi all,
I am working on a project for the Pytorch/HF OpenEnv challenge, and part of the challenge is that a participant/team needs to write a blog post (an Article) on HuggingFace about their submission. However, when I try to create an Article, it says I need a "pro" account.. but I already am a HF Pro member and have been for almost a year!
Is anyone else having this issue? I have a bunch of ideas for blog posts/articles so I'd really like to be able to access this feature, even outside of the OpenEnv Challenge. Can someone let me know if there's a way to fix this or something I'm missing?
Thanks,
Neo
updated
a collection
2 days ago
LLMs
upvoted
an
article
3 days ago
OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments
Organizations
Benchmarks
Datasets - Agents
Datasets - Coding
ARC-AGI2
VLMs - Robotics
Embedding Models
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 113 • 5 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 7 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 38 • 2 -
Sleeping13
CrewAI Gradio Support Agent
👁13Build support agent with CrewAI multi-agents and Gradio
Datasets - CryptoSage
VLMs
Agents
Classifier Models
LLMs
Datasets - Pretraining
OCR/Document Processing
ActionLanguageModels
Datasets - MultiModal
Agent-Specific/Function-Calling Models
Datasets - Robotics
MMMs
Models - CryptoSage
Datasets - Reasoning
Spaces
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 86
DataSets
Pokemon_Red_Experiments
Datasets - Pretraining
Benchmarks
OCR/Document Processing
Datasets - Agents
ActionLanguageModels
Datasets - Coding
Datasets - MultiModal
ARC-AGI2
Agent-Specific/Function-Calling Models
VLMs - Robotics
Datasets - Robotics
Embedding Models
MMMs
ICON - Help Agent
-
Console-AI/IT-helpdesk-synthetic-tickets
Viewer • Updated • 500 • 113 • 5 -
aakash0017/it-support-llm
Viewer • Updated • 1.92k • 7 • 3 -
elsonj/IT-Support-Finetuned-DeepSeek-BitWitDataset
Viewer • Updated • 521 • 38 • 2 -
Sleeping13
CrewAI Gradio Support Agent
👁13Build support agent with CrewAI multi-agents and Gradio
Models - CryptoSage
Datasets - CryptoSage
Datasets - Reasoning
VLMs
Spaces
Agents
Research Papers
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Paper • 2502.15425 • Published • 9 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46 -
Visual-RFT: Visual Reinforcement Fine-Tuning
Paper • 2503.01785 • Published • 86
Classifier Models
DataSets
LLMs