Zixian Ma is a Ph.D. student in Computer Science at the University of Washington, Seattle. Before that, she was an undergraduate student in Computer Science at Stanford University. Zixian Ma's research interests are multi-modal models and human-AI interaction.
We present TACO, a family of multi-modal large action models designed to improve performance on complex questions that require multiple capabilities and demand multi-step solutions.