14 2

Jianan Fan

muyiyunzi

muyiyunzi

AI & ML interests

Autonomous Driving, planning and control

Recent Activity

authored a paper 14 days ago

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

authored a paper 14 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

upvoted a paper 15 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

View all activity

Organizations

None yet

authored 2 papers 14 days ago

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Paper • 2312.09245 • Published Dec 14, 2023

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 16 days ago • 187

upvoted a paper 15 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 16 days ago • 187

upvoted a collection 15 days ago

SenseNova-U1

Collection

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 9 items • Updated about 1 hour ago • 67

upvoted a paper about 1 month ago

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Paper • 2604.15093 • Published Apr 16 • 30

upvoted a paper 2 months ago

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Paper • 2603.22918 • Published Mar 24 • 44

upvoted an article 3 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 163

upvoted a paper 5 months ago

SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning

Paper • 2512.24330 • Published Dec 30, 2025 • 36

updated a model 6 months ago

muyiyunzi/pi05_retrain

Robotics • 4B • Updated Nov 28, 2025

published a model 6 months ago

muyiyunzi/pi05_retrain

Robotics • 4B • Updated Nov 28, 2025

upvoted 3 papers 7 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16, 2025 • 70

InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue

Paper • 2510.13747 • Published Oct 15, 2025 • 33

CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

Paper • 2510.07944 • Published Oct 9, 2025 • 25

updated a dataset 8 months ago

muyiyunzi/so100_double_cam_max6_0917

Viewer • Updated Sep 17, 2025 • 15.4k • 52

published a dataset 8 months ago

muyiyunzi/so100_double_cam_max6_0917

Viewer • Updated Sep 17, 2025 • 15.4k • 52

updated a dataset 8 months ago

muyiyunzi/so100_double_cam_0917

Viewer • Updated Sep 17, 2025 • 15.4k • 150

published a dataset 8 months ago

muyiyunzi/so100_double_cam_0917

Viewer • Updated Sep 17, 2025 • 15.4k • 150

updated a dataset 8 months ago

muyiyunzi/so100_yellow_doublecam_cropped

Viewer • Updated Sep 17, 2025 • 60 • 10

published a dataset 8 months ago

muyiyunzi/so100_yellow_doublecam_cropped

Viewer • Updated Sep 17, 2025 • 60 • 10

updated a dataset 8 months ago

muyiyunzi/so100_blue_doublecam_cropped

Viewer • Updated Sep 16, 2025 • 58 • 12

Jianan Fan

AI & ML interests

Recent Activity

Organizations

muyiyunzi's activity

NEO-unify: Building Native Multimodal Unified Models End to End