Balancing Quality and Efficiency in Future AI Systems
Shiqiang Wang, IBM Research
1-2pm 2nd Dec 2024
Abstract
Quality and efficiency are both essential in AI systems as data sources become more diverse and model sizes grow. In this talk, I will present techniques to address challenges in data and model quality as well as their efficiency, which are essential for building high-performing and sustainable AI systems. I will first introduce a toolkit for enhancing the quality of datasets, which can be used in a broad range of learning tasks including the training or fine tuning large language models (LLMs), laying the groundwork for model training with good data. Then, considering the specific challenge where data is distributed unevenly across sources with varying sizes, quality, and availability, such as in the case of federated learning, I will introduce the FedAU algorithm. This algorithm dynamically adjusts aggregation weights in the model training process based on the availability of data sources, to prevent model bias and improve training convergence. Afterwards, I will introduce techniques to make both training and inference more efficient, focusing on a framework that optimizes model selection from a zoo of LLMs to minimize energy usage while maintaining model performance guarantees. Together, these approaches form a blueprint for future AI systems that are capable of learning effectively from a vast amount of data at diverse sources and delivering high quality models while enhancing resource efficiency in real-world applications.
Short Bio
Shiqiang Wang is a Staff Research Scientist at IBM T. J. Watson Research Center, NY, USA. He received his Ph.D. from Imperial College London, United Kingdom, in 2015. His research focuses on the intersection of distributed computing, machine learning, networking, and optimization, currently emphasizing on quality and efficiency aspects related to distributed data and models, which has a broad range of applications including distributed data analytics, efficient model training and inference, edge-based artificial intelligence (Edge AI), and large language models (LLMs). He received the IEEE Communications Society (ComSoc) Leonard G. Abraham Prize in 2021, IEEE ComSoc Best Young Professional Award in Industry in 2021, IBM Outstanding Technical Achievement Awards (OTAA) in 2019, 2021, 2022, and 2023, multiple Invention Achievement Awards from IBM since 2016. For more details, please visit his homepage at https://shiqiang.wang/