MITB Banner

Jeffrey Dean From Google On How Deep Learning Is Transforming Computer Architecture

Share
Deep Learning

Within a few years, deep learning has witnessed a number of improvements and computational advancements by the researchers and academicians. The advancements have not only seen in fundamental areas like speech recognition, computer vision, language understanding, among others but also include other fields such as quantum chemistry, flood forecasting, protein folding, genomics and other such. 

We are currently at the advancing state of the computational performance which has been solving various real-world problems in real-time and thus increasing the potential of machine learning models in the computing industry. According to this analysis report, there are three main factors that drive the advancement of artificial intelligence, which are algorithmic innovation, data and the amount of computing available for training.

The growth of machine learning researchers has been exponentially rising over the past decades. One profound instance is that there are more than 100 research papers per day posted to Arxiv in the machine-learning-related subtopic areas. Recently, Jeffrey Dean, a researcher at Google discussed the revolution of deep learning and its implication for computer architecture and chip design. 

According to Dean, there are three main properties of deep learning as mentioned below:

  • They are very tolerant of reduced-precision computations.
  • The computations performed by most models are simply different compositions of a relatively small handful of operations like matrix multiplies, vector operations, application of convolutional kernels, and other dense linear algebra calculations. 
  • The opportunity exists to build computational hardware that is specialised for dense, low-precision linear algebra is programmable at the level of specifying programs as different compositions of mostly linear algebra-style operations. 

The machine learning research field is moving at a very fast pace and to keep the track and be at the same pace researchers have been trying every possible way. In order to estimate the revolution, researchers such as computer architect, higher-level software system builders and machine learning researchers discussed in various related topics like what are the interesting research trends are starting to appear, implications for machine learning hardware and other such topics. In this discussion, the researchers found out some of the potential points which have been accelerating the growth and research in deep learning.   

Deep Learning For Chip Design

Researchers have found significant potential in the use of machine learning to learn to automatically generate high-quality solutions for a number of different NP-hard optimisation problems which exist in the overall workflow for designing custom ASICs. Besides this, the automated ML-based system also enables rapid design space exploration, as the reward function can be easily adjusted to optimize for different trade-offs in target optimization metrics.  

Deep Learning for Semiconductor Manufacturing Problems

Though Computer Vision has attained dramatic improvements over the last few years, yet there are certain problems in the domain of visual inspection of wafers during the semiconductor manufacturing process which needs to be improved in order to gain more accuracy over the existing approaches. 

Deep Learning for Learned Heuristics in Computer Systems

The third point for accelerating the growth of deep learning is the use of learned heuristics in computer systems such as compilers, operating systems, file systems, networking stacks, etc. Advancement of these heuristics must be taken into account by allowing them to adapt more readily to the actual usage patterns of a system that will help in accelerating the growth and research in deep learning. 

Wrapping Up

The researcher concluded the discussion by mentioning a few interesting threads of research which are occurring in the ML research community at the moment are mentioned below:

  • Work on Sparsely-Activated Models: Sparsely-activated models such as the sparsely-gated mixture of experts model shows how to build very large-capacity models. This model will help the routing functions to learn and expertise on their own and thus specialise on particular instances. 
  • Work on Automated Machine Learning (AutoML): Automated Machine Learning such as Neural Architecture Search (NAS), Evolutionary Architectural Search involves running many automated experiments and can automatically learn effective structures and other aspects of machine learning models. 
  • Multi-Task Training At Modest Scales: Multi-task training at modest scales or transfer learning from a model on a large amount of data has been seen to be very effective these days in a number of complex computation problems. 
PS: The story was written using a keyboard.
Share
Picture of Ambika Choudhury

Ambika Choudhury

A Technical Journalist who loves writing about Machine Learning and Artificial Intelligence. A lover of music, writing and learning something out of the box.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India