MITB Banner

Microsoft open-sources distributed ML library SynapseML

SynapseML runs on Apache Spark, provides a language-agnostic API abstraction over several datastores, and integrates with several existing ML technologies, including Open Neural Network Exchange (ONNX).

Share

Microsoft released SynapseML, an open-source library for creating and managing distributed ML pipelines, software engineer Mark Hamilton announced in his blog.

SynapseML runs on Apache Spark and takes advantage of Spark’s large-scale fault-tolerant compute clusters management. The library has APIs for Python as well as Java, with the ability to generate bindings for Java, R, and C#. 

In addition, it includes the HTTP on Spark module, allowing users efficient integration of web services into their pipelines and pre-built wrappers for invoking several such services, including Azure Cognitive Services

To perform distributed inference on Spark, using ONNX, developers can deploy pre-trained models from Microsoft’s ONNX Model Hub or convert models built in other frameworks like TensorFlow or PyTorch.The Spark Serving module allows developers to expose their Spark pipelines as low-latency web services.

Hamilton, in his blog, said, “Our goal is to free developers from the hassle of worrying about the distributed implementation details and enable them to deploy them into a variety of databases, clusters, and languages without needing to change their code.”

SynapseML also includes tools for responsible AI, such as data balance analysis and model explainability. The library includes support for AutoML features, such as finding the best-performing model using hyperparameter search and Spark-native implementation of several models, including an anomaly-detection model for cyber security; an isolation forest model, which performs nonlinear outlier detection; and a conditional k-nearest-neighbour model.

Share
Picture of Poornima Nataraj

Poornima Nataraj

Poornima Nataraj has worked in the mainstream media as a journalist for 12 years, she is always eager to learn anything new and evolving. Witnessing a revolution in the world of Analytics, she thinks she is in the right place at the right time.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.