1 3

Xunan Dai

FreshmanD

https://github.com/baidu-baige/LoongFlow

FreshmanD

AI & ML interests

None yet

Recent Activity

reacted to their post with 🚀 22 minutes ago

LoongFlow Big News!!! @all We’ve put AI Agents into a production GPU cluster to handle GPU failure prediction. Not as a demo. Not as AutoML. But as an evolving system that designs and improves its own models. On two GPU types: – IT21HMDB01-B2: +30% prediction accuracy – H800: +25% prediction accuracy The resulting models already meet production standards and are being wired into the ops pipeline. How it works: • An ML agent designs the full ML pipeline from scratch • A Math agent performs targeted evolutionary optimization • The agents explore, discard, and iterate toward better modelsHumans don’t hand-tune parameters. This is not offline analysis. GPU failure prediction means: • heavy assets • real incidents • real operational risk The agents now trigger maintenance before failures happen. This feels like an early signal: AI agents are starting to take responsibility for infrastructure-level engineering decisions in production systems. For ML Agent, you can check: https://github.com/baidu-baige/LoongFlow

posted an update 25 minutes ago

reacted to their post with 🔥 5 days ago

Hello everyone, We are thrilled to announce that LoongFlow has officially launched: General Agent! This iteration introduces three major features, bringing the capabilities of intelligent agents to new heights. 1. Claude Agent SDK Deep Integration 📷 * Integrated with Claude Agent SDK: Enhancing the framework’s extensibility * Seamless integration with the Claude Skills ecosystem, sharing powerful tool capabilities 2. Breakthrough Multi-File System Support 📷 * Say goodbye to single-file limitations: Supports complex system development at a full project level 3. Support for AI Self-Evaluation Mode 📷 * Self-evaluation: Agents can assess the quality of their own solutions, saving you the hassle of building evaluation functions 📷 For more details, check out: https://github.com/baidu-baige/LoongFlow/tree/main/agents/general_agent. Feel free to try it out, and let us know if you have any questions or feedback! ~

View all activity

Organizations

Posts 5

Post

LoongFlow Big News!!! @all

We’ve put AI Agents into a production GPU cluster to handle GPU failure prediction.

Not as a demo. Not as AutoML.
But as an evolving system that designs and improves its own models.

On two GPU types:
– IT21HMDB01-B2: +30% prediction accuracy
– H800: +25% prediction accuracy

The resulting models already meet production standards and are being wired into the ops pipeline.

How it works:
• An ML agent designs the full ML pipeline from scratch
• A Math agent performs targeted evolutionary optimization
• The agents explore, discard, and iterate toward better modelsHumans don’t hand-tune parameters.

This is not offline analysis. GPU failure prediction means:
• heavy assets
• real incidents
• real operational risk
The agents now trigger maintenance before failures happen.

This feels like an early signal: AI agents are starting to take responsibility for infrastructure-level engineering decisions in production systems.

For ML Agent, you can check: https://github.com/baidu-baige/LoongFlow

View all Posts

Articles 2

Article

LoongFlow: An Open-Sourced Agent Framework That Transforms Expert Experience into Autonomous AI Productivity

View all Articles

Xunan Dai

AI & ML interests

Recent Activity

Organizations

Posts 5

Articles 2

LoongFlow: An Open-Sourced Agent Framework That Transforms Expert Experience into Autonomous AI Productivity

Papers 1

models 0

datasets 0

Xunan Dai

AI & ML interests

Recent Activity

Organizations

Posts 5

Articles 2

LoongFlow: An Open-Sourced Agent Framework That Transforms Expert Experience into Autonomous AI Productivity

Papers 1

models 0

datasets 0