Submitted by
Zhimin Zhao
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild
Do AI Coding Agents Log Like Humans? An Empirical Study