Amazon Web Services (AWS): Sagemaker: Points to remember (New Updates)

Let's learn about Amazon SageMaker (New Updates):

  1. SageMaker Autopilot is the industry’s first automated machine learning capability that gives complete control and visibility into ML models.

  2. Autopilot automatically inspects raw data, applies feature processors, picks the best set of algorithms, trains & tunes multiple models.

  3. Users get full visibility into how the model was created and what’s in it & SageMaker Autopilot integrates with SageMaker Studio.

  4. SageMaker Autopilot can be used by people without machine learning experience to easily produce a model.

  5. SageMaker provides a full end-to-end workflow, but users can continue to use their existing tools with SageMaker.

  6. SageMaker allows users to select the number and type of instance used for the hosted notebook, training & model hosting.

  7. SageMaker stores code in ML storage volumes, secured by security groups and optionally encrypted at rest.

  8. SageMaker Studio provides a single, web-based visual interface where users can perform all ML development steps.

  9. SageMaker Studio gives users complete access, control & visibility into each step required to build, train & deploy models.

  10. SageMaker Autopilot is a generic automatic ML solution for classification and regression problems, such as fraud detection, churn analysis & targeted marketing.

  11. Users can train models using SageMaker Autopilot and get full access to the models as well as the pipelines that generated the models.

  12. SageMaker Autopilot supports 2 built-in algorithms at launch: XGBoost and Linear Learner.

  13. Amazon SageMaker Autopilot built-in algorithms support distributed training out of the box.

  14. Sagemaker - Jupyter notebooks are supported.

  15. SageMaker Notebooks provide one-click Jupyter notebooks that users can start working with in seconds.

  1. With SageMaker Notebooks users can sign in with their corporate credentials using SSO and start working with notebooks within seconds.

  2. SageMaker Notebooks give users access to all SageMaker features, such as distributed training, batch transform, hosting & experiment management.

  3. SageMaker Ground Truth provides automated data labeling using machine learning.

  4. SageMaker Ground Truth will first select a random sample of data and send it to Mechanical Turk to be labeled.

  5. SageMaker Experiments helps users organize and track iterations to machine learning models.

  6. SageMaker Experiments helps users manage iterations by automatically capturing the input parameters, configurations and results, and storing them as experiments.

  7. SageMaker Debugger makes the training process more transparent by automatically capturing real-time metrics during training such as training and validation, confusion matrices & learning gradients to help improve model accuracy.

  8. The metrics from SageMaker Debugger can be visualized in SageMaker Studio for easy understanding.

  9. SageMaker Debugger can also generate warnings and remediation advice when common training problems are detected.

  10. SageMaker RL includes RL toolkits such as Coach and Ray RLLib that offer implementations of RL agent algorithms such as DQN, PPO, A3C, and many more.

A Points to remember series by Piyush Jalan.