Top 5 Feature Stores from leading tech firms

Just a few days back, LinkedIn open sourced Feathr.

Advertisement

Uber introduced Michelangelo Palette, the first Feature Store in 2017. Feature Store enables the discovery, documentation, and reuse of features. It is a feature computation and storage service that enables features to be registered, discovered, and used for ML pipelines and by online applications for model inferencing. Feature Stores stockpile swathes of feature data and offer low latency access to features for online applications. It also ensures consistent feature computations across the batch and serving APIs. We have compiled a list of the top Feature Stores of leading companies.

Amazon SageMaker Feature Store 

SageMaker Feature Store is a fully managed, purpose-built repository to store, update, retrieve, and share machine learning features.It can ingest data from multiple sources and tags and indexes features to helps users easily discover them through a visual interface in SageMaker Studio. 

THE BELAMY

Sign up for your weekly dose of what's up in emerging technology.

Image: Amazon

It also provides a unified store for features during training and real-time inference and doesn’t require any extra code or manual maintainence. It tracks metadata of stored features to help user query the features for the right attributes in batches or in real time using Amazon Athena. The Feature Store easily integrates with Amazon SageMaker Pipelines to create, add feature search and discovery, and reuse automated machine learning workflows. 

Vertex AI Feature Store 

The Vertex AI Feature Store from Google allows data scientists to focus on the feature computation logic rather than on the problems that come with deploying features in the production stage. Essentially, it is a fully managed centralised repository for organising, storing, and serving ML features. Vertex AI Feature Store provides search and filter capabilities for users to easily discover and reuse existing features. It also helps detect significant changes to your feature data distribution over time ( called drift). Additionally, it constantly tracks the distribution of feature values ingested into the Feature Store.

Feathr-LinkedIn

LinkedIn defines Feathr as an abstraction layer that offers a common feature namespace for defining features and a common platform for computing, serving, and accessing them ‘by name’ within ML workflows. 

Feathr enhances machine learning productivity by reducing the need for individual teams to manage feature pipelines with the option of sharing features easily across projects. Feathr also has advanced support for feature transformations, enabling users toexperiment with new features based on raw data sets. Just a few days back, LinkedIn open sourced Feathr.

Hopsworks

The Hopswork Feature Store allows scaling a model to hundred times. It improves the quality and re-usability of features and models to develop solutions for business use cases. The open-source Feature Store works for environments such as Amazon, Google and Azure. 

ML Lakes-Salesforce

ML Lake allows Salesforce application developers and data scientists to easily build machine learning capabilities on customer and non-customer data. It centralises the security controls required for maintaining trust. It assists developers in managing data pipelines, storage, security and compliance.

Image: Salesforce

Eli Levine from Salesforce engineering said ML Lake is deployed in multiple AWS regions as a shared service for use by internal Salesforce teams and in applications running in both public cloud providers and Salesforce’s own data centres.

Overton-Apple

Overton also wants to help developers to concentrate on higher-level tasks instead of lower-level machine learning tasks. Overton supports developers and engineers in building, monitoring, and improving production machine learning systems. Apple adds that Overton  “automates the life cycle of model construction, deployment, and monitoring by providing a set of novel high-level, declarative abstractions.”

More Great AIM Stories

Sreejani Bhattacharyya
I am a technology journalist at AIM. What gets me excited is deep-diving into new-age technologies and analysing how they impact us for the greater good. Reach me at sreejani.bhattacharyya@analyticsindiamag.com

Our Upcoming Events

Conference, in-person (Bangalore)
MachineCon 2022
24th Jun

Conference, Virtual
Deep Learning DevCon 2022
30th Jul

Conference, in-person (Bangalore)
Cypher 2022
21-23rd Sep

3 Ways to Join our Community

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Telegram Channel

Discover special offers, top stories, upcoming events, and more.

Subscribe to our newsletter

Get the latest updates from AIM
MORE FROM AIM