Benjamim Alves Nepomuceno Neto
AI & ML interests
Recent Activity
Organizations
-
RunningFeatured2k
Wan2.1
💻2kWan: Open and Advanced Large-Scale Video Generative Models
-
Runtime errorMCPFeatured1.6k
Wan2.1 Fast
🎥1.6kGenerate a video from an image with a prompt
-
Runtime errorFeatured72
NAG Wan2-1-fast
🏢72Demo of Normalized Attention Guidance for 4 steps Wan2.1
-
PausedMCPFeatured322
Self Forcing Wan 2.1
🎥322Real-time video generation
-
Running38
Mediapipe Face Mesh 3d
👀38create 3d-gltf face-mesh from image with mediapipe
-
Running5
Mediapipe Head Pose Estimation
👁52 head pose estimation with mediapipe and trained-model
-
Running9
Mediapipe 68 Points Facial Mask
⚡9create facial masks from 68 points landmark
-
Running on ZeroFeatured1.1k
InfiniteYou-FLUX
📸1.1kFlexible Photo Recrafting While Preserving Your Identity
-
Runtime errorFeatured205
MatAnyone
🤡205Gradio demo for MatAnyone
-
Running on ZeroFeatured563
Video Background Removal
📽563Remove/Change background of video.
-
Running on ZeroFeatured106
SAM3 Video Segmentation
🐠106Track and label objects in videos using text prompts or clicks
-
Running on Zero14
VideoMaMa
⚡14Remove video backgrounds and generate matte videos
-
Build error116
Dpt Depth Estimation + 3D Voxels
🧊116Create 3D models from images using depth estimation
-
Running on Zero3.21k
Hunyuan3D-2.0
🌍3.21kText-to-3D and Image-to-3D Generation
-
Running on ZeroFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
-
Running on ZeroFeatured216
Video Depth Anything
👀216Generate depth video from input video
-
RunningFeatured178
Manimator
👀178Transform research papers and mathematical concepts into stu
-
PausedFeatured181
Gaze Demo
👀181Gaze detection using Moondream
-
Running11
Metropolitan Museum
🎨11The Metropolitan Museum of Art Collection
-
SleepingFeatured117
CountGD_Multi-Modal_Open-World_Counting
🚀117Count objects in images using text, visual examples, or both
-
Running on ZeroFeatured564
Midi Music Generator
🎼564Generate MIDI music with custom instruments and settings
-
PausedFeatured202
YuE
👩202Generate music from lyrics and genre tags
-
Paused51
Open SUNO
👩51Your Lyrics into Complete Songs with Vocals in Multilingual
-
Running on ZeroFeatured677
Di♪♪Rhythm
🎶677Blazingly Fast and Embarrassingly Simple Song Generation
-
Running on ZeroFeatured260
SD3 Long Captioner
🏃260Generate detailed captions for images using AI
-
Runtime errorFeatured111
ChartGemma
🐨111Generate insights from charts using text prompts
-
Running on Zero90
AuraFlow-v0.3 with Captioner
🖼90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 7.95M • 1.96k
-
Runtime errorFeatured462
Omni-Zero
🧛462Restylize & repose person ID
-
Running on Zero1.2k
PhotoMaker V2
📷1.2kGenerate personalized portrait images of a specific person
-
Runtime errorFeatured642
FLUX.1 [Inpainting]
🎨642 -
Running on L40SFeatured1.61k
Expression Editor
🐨1.61kQuickly edit the expression of a face
-
Running on ZeroFeatured927
MMAudio — generating synchronized audio from video/text
🔊927Generate audio from video and text prompts
-
Running on Zero325
TangoFlux
🚀325Text to Audio (Sound SFX) Generator
-
Running on Zero457
Stable Audio Open Zero
🔥457Generate custom audio clips from text prompts
-
PausedFeatured202
YuE
👩202Generate music from lyrics and genre tags
-
Running376
PDF Chatbot
🌍376Ask questions about PDFs using a chatbot
-
Runtime errorFeatured367
Video Transcription Smart Summary
⚡367Generate summaries from YouTube videos or uploaded videos
-
Running136
Quantized Retrieval
🔍136Efficient quantized retrieval over Wikipedia
-
RunningFeatured1.29k
FineWeb: decanting the web for the finest text data at scale
🍷1.29kRead about FineWeb, a large web‑text dataset for LLMs
-
Running40
Anime Image Classification
📚40Analyze anime images for various attributes
-
Running on ZeroFeatured169
PaintsUndo
🎨169Generate key frames and video from one uploaded image
-
Running on Zero159
Kolors IP-Adapter
🖼159Generate images using a prompt and reference picture
-
Running on ZeroFeatured2.07k
PuLID-FLUX
🤗2.07kGenerate customized images from a prompt and reference photo
-
Runtime errorFeatured93
Panoptic Segment Anything
🖼93 -
Runtime errorFeatured396
Grounded Segment Anything
📚396 -
Running on Zero200
Inspyrenet Remove Background
🏢200Remove backgrounds or get masks from your images
-
Runtime errorFeatured515
Florence2 + SAM2
🔥515Segment and caption objects in images and videos
-
RunningFeatured113
BigVGAN
🔊113Generate high‑quality audio and spectrogram from your clip
-
Running24
Audio Emotion Recognition
🎼24Detect emotions from audio recordings
-
SleepingFeatured61
SoundwaveDemo
📉61Process audio and generate text output based on instructions
-
RunningFeatured70
DiffVox
🦀70Enhance vocals with professional effects using sliders
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification • 86.6M • Updated • 430k • 339 -
Running on Zero313
Llasa 3b Tts
🔥313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
PausedFeatured202
YuE
👩202Generate music from lyrics and genre tags
-
Running on ZeroFeatured411
Zonos
🌍411Generate natural-sounding speech from text with voice control
-
RunningFeatured2k
Wan2.1
💻2kWan: Open and Advanced Large-Scale Video Generative Models
-
Runtime errorMCPFeatured1.6k
Wan2.1 Fast
🎥1.6kGenerate a video from an image with a prompt
-
Runtime errorFeatured72
NAG Wan2-1-fast
🏢72Demo of Normalized Attention Guidance for 4 steps Wan2.1
-
PausedMCPFeatured322
Self Forcing Wan 2.1
🎥322Real-time video generation
-
Running38
Mediapipe Face Mesh 3d
👀38create 3d-gltf face-mesh from image with mediapipe
-
Running5
Mediapipe Head Pose Estimation
👁52 head pose estimation with mediapipe and trained-model
-
Running9
Mediapipe 68 Points Facial Mask
⚡9create facial masks from 68 points landmark
-
Running on ZeroFeatured1.1k
InfiniteYou-FLUX
📸1.1kFlexible Photo Recrafting While Preserving Your Identity
-
Runtime errorFeatured205
MatAnyone
🤡205Gradio demo for MatAnyone
-
Running on ZeroFeatured563
Video Background Removal
📽563Remove/Change background of video.
-
Running on ZeroFeatured106
SAM3 Video Segmentation
🐠106Track and label objects in videos using text prompts or clicks
-
Running on Zero14
VideoMaMa
⚡14Remove video backgrounds and generate matte videos
-
Build error116
Dpt Depth Estimation + 3D Voxels
🧊116Create 3D models from images using depth estimation
-
Running on Zero3.21k
Hunyuan3D-2.0
🌍3.21kText-to-3D and Image-to-3D Generation
-
Running on ZeroFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
-
Running on ZeroFeatured216
Video Depth Anything
👀216Generate depth video from input video
-
RunningFeatured178
Manimator
👀178Transform research papers and mathematical concepts into stu
-
PausedFeatured181
Gaze Demo
👀181Gaze detection using Moondream
-
Running11
Metropolitan Museum
🎨11The Metropolitan Museum of Art Collection
-
SleepingFeatured117
CountGD_Multi-Modal_Open-World_Counting
🚀117Count objects in images using text, visual examples, or both
-
Running on ZeroFeatured927
MMAudio — generating synchronized audio from video/text
🔊927Generate audio from video and text prompts
-
Running on Zero325
TangoFlux
🚀325Text to Audio (Sound SFX) Generator
-
Running on Zero457
Stable Audio Open Zero
🔥457Generate custom audio clips from text prompts
-
PausedFeatured202
YuE
👩202Generate music from lyrics and genre tags
-
Running on ZeroFeatured564
Midi Music Generator
🎼564Generate MIDI music with custom instruments and settings
-
PausedFeatured202
YuE
👩202Generate music from lyrics and genre tags
-
Paused51
Open SUNO
👩51Your Lyrics into Complete Songs with Vocals in Multilingual
-
Running on ZeroFeatured677
Di♪♪Rhythm
🎶677Blazingly Fast and Embarrassingly Simple Song Generation
-
Running on ZeroFeatured260
SD3 Long Captioner
🏃260Generate detailed captions for images using AI
-
Runtime errorFeatured111
ChartGemma
🐨111Generate insights from charts using text prompts
-
Running on Zero90
AuraFlow-v0.3 with Captioner
🖼90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 7.95M • 1.96k
-
Running376
PDF Chatbot
🌍376Ask questions about PDFs using a chatbot
-
Runtime errorFeatured367
Video Transcription Smart Summary
⚡367Generate summaries from YouTube videos or uploaded videos
-
Running136
Quantized Retrieval
🔍136Efficient quantized retrieval over Wikipedia
-
RunningFeatured1.29k
FineWeb: decanting the web for the finest text data at scale
🍷1.29kRead about FineWeb, a large web‑text dataset for LLMs
-
Runtime errorFeatured462
Omni-Zero
🧛462Restylize & repose person ID
-
Running on Zero1.2k
PhotoMaker V2
📷1.2kGenerate personalized portrait images of a specific person
-
Runtime errorFeatured642
FLUX.1 [Inpainting]
🎨642 -
Running on L40SFeatured1.61k
Expression Editor
🐨1.61kQuickly edit the expression of a face
-
Running40
Anime Image Classification
📚40Analyze anime images for various attributes
-
Running on ZeroFeatured169
PaintsUndo
🎨169Generate key frames and video from one uploaded image
-
Running on Zero159
Kolors IP-Adapter
🖼159Generate images using a prompt and reference picture
-
Running on ZeroFeatured2.07k
PuLID-FLUX
🤗2.07kGenerate customized images from a prompt and reference photo
-
Runtime errorFeatured93
Panoptic Segment Anything
🖼93 -
Runtime errorFeatured396
Grounded Segment Anything
📚396 -
Running on Zero200
Inspyrenet Remove Background
🏢200Remove backgrounds or get masks from your images
-
Runtime errorFeatured515
Florence2 + SAM2
🔥515Segment and caption objects in images and videos
-
RunningFeatured113
BigVGAN
🔊113Generate high‑quality audio and spectrogram from your clip
-
Running24
Audio Emotion Recognition
🎼24Detect emotions from audio recordings
-
SleepingFeatured61
SoundwaveDemo
📉61Process audio and generate text output based on instructions
-
RunningFeatured70
DiffVox
🦀70Enhance vocals with professional effects using sliders
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification • 86.6M • Updated • 430k • 339 -
Running on Zero313
Llasa 3b Tts
🔥313Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
PausedFeatured202
YuE
👩202Generate music from lyrics and genre tags
-
Running on ZeroFeatured411
Zonos
🌍411Generate natural-sounding speech from text with voice control