AIM Banners_978 x 90

Interview with the team behind Microsoft’s µTransfer

Recently, researchers – Edward Hu, Greg Yang, Jianfeng Gao from Microsoft, introduced µ-Parametrization, which offers maximal feature learning even in infinite-width limit.
Microsoft
With machine learning models getting bigger in size, training them is becoming quite a challenging task. Especially the task of tuning, when the parameters run into the order of billions (even trillion now), makes it a highly cumbersome and resource-intensive process. Recently, researchers –  Edward Hu, PhD Student  Greg Yang, Senior Researcher Jianfeng Gao, distinguished scientist and vice president from Microsoft, introduced µ-Parametrization, which offers maximal feature learning even in infinite-width limit. The researchers further collaborated with OpenAI to demonstrate its practical advantages, which was recorded in this paper. We caught up with Edward Hu and Greg Yang to learn more about their research. Edited excerpts: AIM: How did you recognise this as a prob
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Shraddha Goled
Shraddha Goled
I am a technology journalist with AIM. I write stories focused on the AI landscape in India and around the world with a special interest in analysing its long term impact on individuals and societies. Reach out to me at shraddha.goled@analyticsindiamag.com.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed