Interesting SSL papers EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper β’ 2311.02077 β’ Published Nov 3, 2023 β’ 15 System 2 Attention (is something you might need too) Paper β’ 2311.11829 β’ Published Nov 20, 2023 β’ 43 Large Language Models for Mathematicians Paper β’ 2312.04556 β’ Published Dec 7, 2023 β’ 12 VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper β’ 2403.00522 β’ Published Mar 1, 2024 β’ 46
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper β’ 2311.02077 β’ Published Nov 3, 2023 β’ 15
System 2 Attention (is something you might need too) Paper β’ 2311.11829 β’ Published Nov 20, 2023 β’ 43
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper β’ 2403.00522 β’ Published Mar 1, 2024 β’ 46
LLM Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published Dec 6, 2024 β’ 160 Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published Dec 6, 2024 β’ 160
Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters
Interesting SSL papers EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper β’ 2311.02077 β’ Published Nov 3, 2023 β’ 15 System 2 Attention (is something you might need too) Paper β’ 2311.11829 β’ Published Nov 20, 2023 β’ 43 Large Language Models for Mathematicians Paper β’ 2312.04556 β’ Published Dec 7, 2023 β’ 12 VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper β’ 2403.00522 β’ Published Mar 1, 2024 β’ 46
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper β’ 2311.02077 β’ Published Nov 3, 2023 β’ 15
System 2 Attention (is something you might need too) Paper β’ 2311.11829 β’ Published Nov 20, 2023 β’ 43
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper β’ 2403.00522 β’ Published Mar 1, 2024 β’ 46
LLM Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published Dec 6, 2024 β’ 160 Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published Dec 6, 2024 β’ 160
Running 3.69k The Ultra-Scale Playbook π 3.69k The ultimate guide to training LLM on large GPU Clusters