Google AI researchers present a new method to train models, ‘DeepCTRL’

DeepCTRL is now being evaluated on machine learning use cases in physics and healthcare, both of which rely heavily on rules.

Deep neural networks (DNNs) provide increasingly accurate outputs as the volume and variety of their training data increases. While investing in high-quality, large-scale labelled datasets are one strategy to improve models, and another is to apply “rules” – reasoning heuristics, equations, associative logic, or limitations. Consider the classic physics problem of forecasting the future state of a double pendulum system using a model. While the model may learn to predict the system’s total energy at a given point in time only from empirical data, it will often overestimate the energy unless given an equation that incorporates known physical constraints, such as energy conservation. The model can’t represent such well-established physical principles on its own. How could such rules be taught so that DNNs gain the necessary data rather than simply learn from it?

What is DeepCTRL 

Google Cloud AI researchers have offered a unique deep learning training approach that incorporates rules so that the strength of the rules may be controlled at inference. DeepCTRL (Deep Neural Networks with Controllable Rule Representations) combines a rule encoder and a rule-based objective into the model, allowing for a shared representation for decision-making. Data type and model architecture are unimportant to DeepCTRL. 

It can be used with any input/output rule. The key feature of DeepCTRL is that it does not require retraining to alter rule strength – the user may adjust it at inference based on the desired accuracy vs rule verification ratio. 

Subscribe to our Newsletter

Join our editors every weekday evening as they steer you through the most significant news of the day, introduce you to fresh perspectives, and provide unexpected moments of joy
Your newsletter subscriptions are subject to AIM Privacy Policy and Terms and Conditions.

Advantages of DeepCTRL

The advantages of learning through rules are numerous. For starters, rules can provide additional information in circumstances where data oversight is limited, boosting test accuracy. Second, the rules can help DNNs gain more trust and reliability. The fact that DNNs are ‘black-box’ is a key roadblock to their widespread adoption. Users’ trust is often eroded due to a lack of comprehension of the reasons behind their reasoning and discrepancies of their outputs with human judgement. Inconsistencies can be minimised, and users’ confidence can be improved by implementing rules. For example, if a DNN for loan delinquency prediction can absorb all of a bank’s decision heuristics, the bank’s loan officers can have more confidence in the forecasts. 

Third, DNNs are sensitive to a variety of inputs that are incomprehensive to humans. The impact of these modifications can be reduced using rules since the model search space is confined further to reduce underspecification.

Previous research

Various ways for incorporating ‘rules’ into deep learning, taking into account existing knowledge in a wide range of applications, have been investigated. One method for injecting rules into forecasts is posterior regularisation. The teacher network is created by projecting the student network into a (logic) rule-regularised subspace and then updating the student network to balance between replicating the teacher’s output and anticipating the labels. Adversarial learning is used to penalise undesired biases, specifically for bias rules.

What makes DeepCTRL different? 

DeepCTRL proposes a framework for training with rules that take advantage of Lagrangian duality. Learning with constraints is investigated in using a formulation over the space of confusion matrices and optimisation solvers that work in a series of linear reduction steps. For variational autoencoders, KL divergence is used to inject constraints for regulating output diversity or disentangled latent factor representations. DeepCTRL differs from the others in that it injects the rules so that it provides controllability of rule strength at inference without retraining, which is possible by correct learning of rule representations in the data manifold. Beyond simply increasing rule verification for target precision, this opens up new possibilities.

Conclusion 

DeepCTRL offers a number of possible uses in real-world deep learning deployments, including improving accuracy, increasing reliability, and improving human-AI interaction. On the other hand, the researchers have thought it pertinent to highlight that DeepCTRL’s capacity to effectively encode rules can have unintended consequences if it is used with wrong intentions to teach immoral prejudices.

Abhishree Choudhary
Abhishree is a budding tech journalist with a UGD in Political Science. In her free time, Abhishree can be found watching French new wave classic films and playing with dogs.

Download our Mobile App

MachineHack | AI Hackathons, Coding & Learning

Host Hackathons & Recruit Great Data Talent!

AIMResearch Pioneering advanced AI market research

With a decade of experience under our belt, we are transforming how businesses use AI & data-driven insights to succeed.

The Gold Standard for Recognizing Excellence in Data Science and Tech Workplaces

With Best Firm Certification, you can effortlessly delve into the minds of your employees, unveil invaluable perspectives, and gain distinguished acclaim for fostering an exceptional company culture.

AIM Leaders Council

World’s Biggest Community Exclusively For Senior Executives In Data Science And Analytics.

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
MOST POPULAR