Amazon MLS-C01 Dumps PDF

April, 2025 MLS-C01 Practice Questions

Make success possible with our Latest and Unique AWS Certified Specialty MLS-C01 Practice Exam!


Name: AWS Certified Machine Learning - Specialty
Exam Code: MLS-C01
Certification: AWS Certified Specialty
Vendor: Amazon
Total Questions: 307
Last Updated: April 24, 2025
234 Satisfied Customers

discount banner

$59 $82.6 Add To Cart

Last Week Results

18
Amazon MLS-C01 customers passed exam this week.
98%
Average Score in Real MLS-C01 Exam in Testing Centre.
92%
MLS-C01 Exam Questions came from DumpsGroup Material.



Unique Spoto Amazon MLS-C01 Practice Questions

Success is simply the result of the efforts you put into the preparation. We at Dumpsgroup wish to make that preparation a lot easier. The AWS Certified Machine Learning - Specialty MLS-C01 Practice Exam we offer is solely for best results. Our IT experts put in their blood and sweat into carefully selecting and compiling these unique Practice Questions. So, you can achieve your dreams of becoming a AWS Certified Specialty professional. Now is the time to press that big buy button and take the first step to a better and brighter future.

Passing the Amazon MLS-C01 exam is simpler if you have globally valid resources and Dumpsgroup provides you just that. Millions of customers come to us daily, leaving the platform happy and satisfied. Because we aim to provide you with AWS Certified Specialty Practice Questions aligned with the latest patterns of the AWS Certified Machine Learning - Specialty Exam. And not just that, our reliable customer services are 24 hours at your beck and call to support you in every way necessary. Order now to see the MLS-C01 Exam results you always desired.

2 Surefire Ways to Pass Amazon MLS-C01 Exam!

You must have heard about candidates failing in a large quantity and perhaps tried yourself and fail to pass AWS Certified Machine Learning - Specialty. It is best to try Dumpsgroup’s MLS-C01 Practice Questions this time around. Dumpsgroup not only provides an authentic, valid, and accurate resource for your preparation. They simplified the training by dividing it into two different formats for ease and comfort. Now you can get the Amazon MLS-C01 in both PDF and Online Test Engine formats. Choose whichever or both to start your AWS Certified Specialty certification exam preparation.

Furthermore, Dumpsgroup gives a hefty percentage off on these Spoto MLS-C01 Practice Exam by applying a simple discount code; when the actual price is already so cheap. The updates for the first three months, from the date of your purchase, are FREE. Our esteemed customers cannot stop singing praises of our Amazon MLS-C01 Practice Questions. That is because we offer only the questions with the highest possibility of appearing in the actual exam. Download the free demo and see for yourself.

The MLS-C01 Practice Exam for Achievers

We know you have been struggling to compete with your colleagues in your workplace. That is why we provide the MLS-C01 Practice Questions to let you gain the upper hand that you always wanted. These questions and answers are a thorough guide in a simple and exam-like format! That makes understanding and excelling in your field way lot easier. Our aim is not just to help to pass the AWS Certified Specialty Exam but to make a Amazon professional out of you. For that purpose, our MLS-C01 Practice Exams are the best choice.

Why You Choose Us:

  1. We can give you a million reasons to choose us for your AWS Certified Machine Learning - Specialty preparation. But we narrow down to the basics:
  2. Our Free MLS-C01 Practice Questions in the demo version are easily downloadable. A surefire way to ensure you are entrusting your training to a reliable resource is looking at it yourself.
  3. Online Test Engine & PDF: we give you two different methods to prepare your AWS Certified Specialty exam; MLS-C01 Practice Exam PDF and an online Test Engine version. Now you can advance your skills in the real-like exam practice environment. Choose the method that suits you best and prepare yourself for success.
  4. Safe & Secure Transaction: you can take it easy while buying your MLS-C01 Practice Questions. Dumpsgroup uses the latest and secure payment method to preserve our customer privacy and money. Our staff personnel have aligned capable security systems with high-end security technology. You know your details are safe with us because we never save them to avoid any inconvenience later.
  5. 24-hour customer support: you no longer have to worry about getting into trouble because our reliable customer care staff are active 24 hours to provide you support whenever you want.

MLS-C01 Practice Exam to Pass!

There are many resources available online for the preparation of the AWS Certified Machine Learning - Specialty Exam. But that does mean that all of them are reliable. When your future as a AWS Certified Specialty certified is at risk, you have got to think twice while choosing Amazon MLS-C01 Practice Questions. Dumpsgroup is not only a verified source of training material but has been in this business for years. In those years, we researched on MLS-C01 Practice Exam and came up with the best solution. So, you can trust that we know what we are doing. Moreover, we have joined hands with Amazon experts and professionals who are exceptional in their skills. And these experts approved our MLS-C01 Practice Questions for AWS Certified Machine Learning - Specialty preparation.

Sample Questions


MLS-C01 Sample Question 1


A company wants to enhance audits for its machine learning (ML) systems. The auditing
system must be able to perform metadata analysis on the features that the ML models use.
The audit solution must generate a report that analyzes the metadata. The solution also
must be able to set the data sensitivity and authorship of features.
Which solution will meet these requirements with the LEAST development effort?

A. Use Amazon SageMaker Feature Store to select the features. Create a data flow toperform feature-level metadata analysis. Create an Amazon DynamoDB table to storefeature-level metadata. Use Amazon QuickSight to analyze the metadata.
B. Use Amazon SageMaker Feature Store to set feature groups for the current featuresthat the ML models use. Assign the required metadata for each feature. Use SageMakerStudio to analyze the metadata.
C. Use Amazon SageMaker Features Store to apply custom algorithms to analyze thefeature-level metadata that the company requires. Create an Amazon DynamoDB table tostore feature-level metadata. Use Amazon QuickSight to analyze the metadata.
D. Use Amazon SageMaker Feature Store to set feature groups for the current featuresthat the ML models use. Assign the required metadata for each feature. Use AmazonQuickSight to analyze the metadata.


ANSWER : D



MLS-C01 Sample Question 2


A retail company is ingesting purchasing records from its network of 20,000 stores to
Amazon S3 by using Amazon Kinesis Data Firehose. The company uses a small, serverbased
application in each store to send the data to AWS over the internet. The company
uses this data to train a machine learning model that is retrained each day. The company's
data science team has identified existing attributes on these records that could be
combined to create an improved model.
Which change will create the required transformed records with the LEAST operational
overhead?

A. Create an AWS Lambda function that can transform the incoming records. Enable datatransformation on the ingestion Kinesis Data Firehose delivery stream. Use the Lambdafunction as the invocation target.
B. Deploy an Amazon EMR cluster that runs Apache Spark and includes the transformationlogic. Use Amazon EventBridge (Amazon CloudWatch Events) to schedule an AWS Lambda function to launch the cluster each day and transform the records that accumulatein Amazon S3. Deliver the transformed records to Amazon S3.
C. Deploy an Amazon S3 File Gateway in the stores. Update the in-store software todeliver data to the S3 File Gateway. Use a scheduled daily AWS Glue job to transform thedata that the S3 File Gateway delivers to Amazon S3.
D. Launch a fleet of Amazon EC2 instances that include the transformation logic. Configurethe EC2 instances with a daily cron job to transform the records that accumulate in AmazonS3. Deliver the transformed records to Amazon S3.


ANSWER : A



MLS-C01 Sample Question 3


A Machine Learning Specialist is training a model to identify the make and model of
vehicles in images The Specialist wants to use transfer learning and an existing model
trained on images of general objects The Specialist collated a large custom dataset of
pictures containing different vehicle makes and models.
What should the Specialist do to initialize the model to re-train it with the custom data?

A. Initialize the model with random weights in all layers including the last fully connectedlayer
B. Initialize the model with pre-trained weights in all layers and replace the last fullyconnected layer.
C. Initialize the model with random weights in all layers and replace the last fully connectedlayer
D. Initialize the model with pre-trained weights in all layers including the last fully connectedlayer


ANSWER : B



MLS-C01 Sample Question 4


A data science team is working with a tabular dataset that the team stores in Amazon S3.
The team wants to experiment with different feature transformations such as categorical
feature encoding. Then the team wants to visualize the resulting distribution of the dataset.
After the team finds an appropriate set of feature transformations, the team wants to
automate the workflow for feature transformations.
Which solution will meet these requirements with the MOST operational efficiency?

A. Use Amazon SageMaker Data Wrangler preconfigured transformations to explorefeature transformations. Use SageMaker Data Wrangler templates for visualization. Exportthe feature processing workflow to a SageMaker pipeline for automation.
B. Use an Amazon SageMaker notebook instance to experiment with different featuretransformations. Save the transformations to Amazon S3. Use Amazon QuickSight forvisualization. Package the feature processing steps into an AWS Lambda function forautomation.
C. Use AWS Glue Studio with custom code to experiment with different featuretransformations. Save the transformations to Amazon S3. Use Amazon QuickSight forvisualization. Package the feature processing steps into an AWS Lambda function forautomation.
D. Use Amazon SageMaker Data Wrangler preconfigured transformations to experimentwith different feature transformations. Save the transformations to Amazon S3. UseAmazon QuickSight for visualzation. Package each feature transformation step into aseparate AWS Lambda function. Use AWS Step Functions for workflow automation.


ANSWER : A



MLS-C01 Sample Question 5


A company’s data scientist has trained a new machine learning model that performs better
on test data than the company’s existing model performs in the production environment.
The data scientist wants to replace the existing model that runs on an Amazon SageMaker
endpoint in the production environment. However, the company is concerned that the new
model might not work well on the production environment data.
The data scientist needs to perform A/B testing in the production environment to evaluate
whether the new model performs well on production environment data.
Which combination of steps must the data scientist take to perform the A/B testing?
(Choose two.)

A. Create a new endpoint configuration that includes a production variant for each of thetwo models.
B. Create a new endpoint configuration that includes two target variants that point todifferent endpoints.
C. Deploy the new model to the existing endpoint.
D. Update the existing endpoint to activate the new model.
E. Update the existing endpoint to use the new endpoint configuration.


ANSWER : A,E



MLS-C01 Sample Question 6


A wildlife research company has a set of images of lions and cheetahs. The company
created a dataset of the images. The company labeled each image with a binary label that
indicates whether an image contains a lion or cheetah. The company wants to train a
model to identify whether new images contain a lion or cheetah.
.... Dh Amazon SageMaker algorithm will meet this requirement?

A. XGBoost
B. Image Classification - TensorFlow
C. Object Detection - TensorFlow
D. Semantic segmentation - MXNet


ANSWER : B



MLS-C01 Sample Question 7


The chief editor for a product catalog wants the research and development team to build a
machine learning system that can be used to detect whether or not individuals in a
collection of images are wearing the company's retail brand. The team has a set of training
data.
Which machine learning algorithm should the researchers use that BEST meets their
requirements?

A. Latent Dirichlet Allocation (LDA)
B. Recurrent neural network (RNN)
C. K-means
D. Convolutional neural network (CNN)


ANSWER : D



MLS-C01 Sample Question 8


A data scientist wants to use Amazon Forecast to build a forecasting model for inventory
demand for a retail company. The company has provided a dataset of historic inventory
demand for its products as a .csv file stored in an Amazon S3 bucket. The table below
shows a sample of the dataset.


How should the data scientist transform the data?

A. Use ETL jobs in AWS Glue to separate the dataset into a target time series dataset andan item metadata dataset. Upload both datasets as .csv files to Amazon S3.
B. Use a Jupyter notebook in Amazon SageMaker to separate the dataset into a relatedtime series dataset and an item metadata dataset. Upload both datasets as tables inAmazon Aurora.
C. Use AWS Batch jobs to separate the dataset into a target time series dataset, a relatedtime series dataset, and an item metadata dataset. Upload them directly to Forecast from alocal machine.
D. Use a Jupyter notebook in Amazon SageMaker to transform the data into the optimizedprotobuf recordIO format. Upload the dataset in this format to Amazon S3.


ANSWER : A



MLS-C01 Sample Question 9


A company uses sensors on devices such as motor engines and factory machines to
measure parameters, temperature and pressure. The company wants to use the sensor
data to predict equipment malfunctions and reduce services outages.
The Machine learning (ML) specialist needs to gather the sensors data to train a model to
predict device malfunctions The ML spoctafst must ensure that the data does not contain
outliers before training the ..el.
What can the ML specialist meet these requirements with the LEAST operational
overhead?

A. Load the data into an Amazon SagcMaker Studio notebook. Calculate the first and thirdquartile Use a SageMaker Data Wrangler data (low to remove only values that are outside of those quartiles.
B. Use an Amazon SageMaker Data Wrangler bias report to find outliers in the dataset Usea Data Wrangler data flow to remove outliers based on the bias report.
C. Use an Amazon SageMaker Data Wrangler anomaly detection visualization to findoutliers in the dataset. Add a transformation to a Data Wrangler data flow to removeoutliers.
D. Use Amazon Lookout for Equipment to find and remove outliers from the dataset.


ANSWER : C



MLS-C01 Sample Question 10


A machine learning (ML) specialist uploads 5 TB of data to an Amazon SageMaker Studio
environment. The ML specialist performs initial data cleansing. Before the ML specialist
begins to train a model, the ML specialist needs to create and view an analysis report that
details potential bias in the uploaded data.
Which combination of actions will meet these requirements with the LEAST operational
overhead? (Choose two.)

A. Use SageMaker Clarify to automatically detect data bias
B. Turn on the bias detection option in SageMaker Ground Truth to automatically analyzedata features.
C. Use SageMaker Model Monitor to generate a bias drift report.
D. Configure SageMaker Data Wrangler to generate a bias report.
E. Use SageMaker Experiments to perform a data check


ANSWER : A,D



MLS-C01 Sample Question 11


A chemical company has developed several machine learning (ML) solutions to identify
chemical process abnormalities. The time series values of independent variables and the
labels are available for the past 2 years and are sufficient to accurately model the problem.
The regular operation label is marked as 0. The abnormal operation label is marked as 1 .
Process abnormalities have a significant negative effect on the companys profits. The
company must avoid these abnormalities.
Which metrics will indicate an ML solution that will provide the GREATEST probability of
detecting an abnormality?

A. Precision = 0.91Recall = 0.6
B. Precision = 0.61Recall = 0.98
C. Precision = 0.7Recall = 0.9
D. Precision = 0.98Recall = 0.8


ANSWER : B



MLS-C01 Sample Question 12


A finance company needs to forecast the price of a commodity. The company has compiled
a dataset of historical daily prices. A data scientist must train various forecasting models on
80% of the dataset and must validate the efficacy of those models on the remaining 20% of
the dataset.
What should the data scientist split the dataset into a training dataset and a validation
dataset to compare model performance?

A. Pick a date so that 80% to the data points precede the date Assign that group of datapoints as the training dataset. Assign all the remaining data points to the validation dataset.
B. Pick a date so that 80% of the data points occur after the date. Assign that group of datapoints as the training dataset. Assign all the remaining data points to the validation dataset.
C. Starting from the earliest date in the dataset. pick eight data points for the trainingdataset and two data points for the validation dataset. Repeat this stratified sampling untilno data points remain.
D. Sample data points randomly without replacement so that 80% of the data points are inthe training dataset. Assign all the remaining data points to the validation dataset.


ANSWER : A



MLS-C01 Sample Question 13


A company wants to conduct targeted marketing to sell solar panels to homeowners. The
company wants to use machine learning (ML) technologies to identify which houses
already have solar panels. The company has collected 8,000 satellite images as training data and will use Amazon SageMaker Ground Truth to label the data.
The company has a small internal team that is working on the project. The internal team
has no ML expertise and no ML experience.
Which solution will meet these requirements with the LEAST amount of effort from the
internal team?

A. Set up a private workforce that consists of the internal team. Use the private workforceand the SageMaker Ground Truth active learning feature to label the data. Use AmazonRekognition Custom Labels for model training and hosting.
B. Set up a private workforce that consists of the internal team. Use the private workforceto label the data. Use Amazon Rekognition Custom Labels for model training and hosting.
C. Set up a private workforce that consists of the internal team. Use the private workforceand the SageMaker Ground Truth active learning feature to label the data. Use theSageMaker Object Detection algorithm to train a model. Use SageMaker batch transformfor inference.
D. Set up a public workforce. Use the public workforce to label the data. Use theSageMaker Object Detection algorithm to train a model. Use SageMaker batch transformfor inference.


ANSWER : A



MLS-C01 Sample Question 14


Each morning, a data scientist at a rental car company creates insights about the previous
day’s rental car reservation demands. The company needs to automate this process by
streaming the data to Amazon S3 in near real time. The solution must detect high-demand
rental cars at each of the company’s locations. The solution also must create a
visualization dashboard that automatically refreshes with the most recent data.
Which solution will meet these requirements with the LEAST development time?

A. Use Amazon Kinesis Data Firehose to stream the reservation data directly to AmazonS3. Detect high-demand outliers by using Amazon QuickSight ML Insights. Visualize the data in QuickSight.
B. Use Amazon Kinesis Data Streams to stream the reservation data directly to AmazonS3. Detect high-demand outliers by using the Random Cut Forest (RCF) trained model inAmazon SageMaker. Visualize the data in Amazon QuickSight.
C. Use Amazon Kinesis Data Firehose to stream the reservation data directly to AmazonS3. Detect high-demand outliers by using the Random Cut Forest (RCF) trained model inAmazon SageMaker. Visualize the data in Amazon QuickSight.
D. Use Amazon Kinesis Data Streams to stream the reservation data directly to AmazonS3. Detect high-demand outliers by using Amazon QuickSight ML Insights. Visualize thedata in QuickSight.


ANSWER : A



MLS-C01 Sample Question 15


A beauty supply store wants to understand some characteristics of visitors to the store. The
store has security video recordings from the past several years. The store wants to
generate a report of hourly visitors from the recordings. The report should group visitors by
hair style and hair color.
Which solution will meet these requirements with the LEAST amount of effort?

A. Use an object detection algorithm to identify a visitor’s hair in video frames. Pass theidentified hair to an ResNet-50 algorithm to determine hair style and hair color.
B. Use an object detection algorithm to identify a visitor’s hair in video frames. Pass theidentified hair to an XGBoost algorithm to determine hair style and hair color.
C. Use a semantic segmentation algorithm to identify a visitor’s hair in video frames. Passthe identified hair to an ResNet-50 algorithm to determine hair style and hair color.
D. Use a semantic segmentation algorithm to identify a visitor’s hair in video frames. Passthe identified hair to an XGBoost algorithm to determine hair style and hair.


ANSWER : C



MLS-C01 Sample Question 16


A manufacturing company needs to identify returned smartphones that have been
damaged by moisture. The company has an automated process that produces 2.000
diagnostic values for each phone. The database contains more than five million phone
evaluations. The evaluation process is consistent, and there are no missing values in the
data. A machine learning (ML) specialist has trained an Amazon SageMaker linear learner
ML model to classify phones as moisture damaged or not moisture damaged by using all
available features. The model's F1 score is 0.6.
What changes in model training would MOST likely improve the model's F1 score? (Select
TWO.)

A. Continue to use the SageMaker linear learner algorithm. Reduce the number of featureswith the SageMaker principal component analysis (PCA) algorithm.
B. Continue to use the SageMaker linear learner algorithm. Reduce the number of featureswith the scikit-iearn multi-dimensional scaling (MDS) algorithm.
C. Continue to use the SageMaker linear learner algorithm. Set the predictor type toregressor.
D. Use the SageMaker k-means algorithm with k of less than 1.000 to train the model
E. Use the SageMaker k-nearest neighbors (k-NN) algorithm. Set a dimension reductiontarget of less than 1,000 to train the model.


ANSWER : A,E



MLS-C01 Sample Question 17


A large company has developed a B1 application that generates reports and dashboards
using data collected from various operational metrics The company wants to provide
executives with an enhanced experience so they can use natural language to get data from
the reports The company wants the executives to be able ask questions using written and
spoken interlaces
Which combination of services can be used to build this conversational interface? (Select
THREE)

A. Alexa for Business
B. Amazon Connect
C. Amazon Lex
D. Amazon Poly
E. Amazon Comprehend
F. Amazon Transcribe


ANSWER : C,E,F



MLS-C01 Sample Question 18


A data scientist has been running an Amazon SageMaker notebook instance for a few
weeks. During this time, a new version of Jupyter Notebook was released along with
additional software updates. The security team mandates that all running SageMaker
notebook instances use the latest security and software updates provided by SageMaker.
How can the data scientist meet these requirements?

A. Call the CreateNotebookInstanceLifecycleConfig API operation
B. Create a new SageMaker notebook instance and mount the Amazon Elastic Block Store(Amazon EBS) volume from the original instance
C. Stop and then restart the SageMaker notebook instance
D. Call the UpdateNotebookInstanceLifecycleConfig API operation


ANSWER : C



MLS-C01 Sample Question 19


A manufacturing company has structured and unstructured data stored in an Amazon S3
bucket. A Machine Learning Specialist wants to use SQL to run queries on this data.
Which solution requires the LEAST effort to be able to query this data?

A. Use AWS Data Pipeline to transform the data and Amazon RDS to run queries.
B. Use AWS Glue to catalogue the data and Amazon Athena to run queries.
C. Use AWS Batch to run ETL on the data and Amazon Aurora to run the queries.
D. Use AWS Lambda to transform the data and Amazon Kinesis Data Analytics to runqueries.


ANSWER : B



MLS-C01 Sample Question 20


A financial services company wants to automate its loan approval process by building a
machine learning (ML) model. Each loan data point contains credit history from a thirdparty
data source and demographic information about the customer. Each loan approval
prediction must come with a report that contains an explanation for why the customer was
approved for a loan or was denied for a loan. The company will use Amazon SageMaker to
build the model.
Which solution will meet these requirements with the LEAST development effort?

A. Use SageMaker Model Debugger to automatically debug the predictions, generate theexplanation, and attach the explanation report.
B. Use AWS Lambda to provide feature importance and partial dependence plots. Use theplots to generate and attach the explanation report.
C. Use SageMaker Clarify to generate the explanation report. Attach the report to thepredicted results.
D. Use custom Amazon Cloud Watch metrics to generate the explanation report. Attach thereport to the predicted results.


ANSWER : C



MLS-C01 Sample Question 21


A data scientist uses Amazon SageMaker Data Wrangler to define and perform
transformations and feature engineering on historical data. The data scientist saves the
transformations to SageMaker Feature Store.
The historical data is periodically uploaded to an Amazon S3 bucket. The data scientist
needs to transform the new historic data and add it to the online feature store The data
scientist needs to prepare the .....historic data for training and inference by using native
integrations.
Which solution will meet these requirements with the LEAST development effort?

A. Use AWS Lambda to run a predefined SageMaker pipeline to perform thetransformations on each new dataset that arrives in the S3 bucket.
B. Run an AWS Step Functions step and a predefined SageMaker pipeline to perform thetransformations on each new dalaset that arrives in the S3 bucket
C. Use Apache Airflow to orchestrate a set of predefined transformations on each newdataset that arrives in the S3 bucket.
D. Configure Amazon EventBridge to run a predefined SageMaker pipeline to perform thetransformations when a new data is detected in the S3 bucket.


ANSWER : D



MLS-C01 Sample Question 22


A data scientist at a financial services company used Amazon SageMaker to train and
deploy a model that predicts loan defaults. The model analyzes new loan applications and
predicts the risk of loan default. To train the model, the data scientist manually extracted
loan data from a database. The data scientist performed the model training and
deployment steps in a Jupyter notebook that is hosted on SageMaker Studio notebooks.
The model's prediction accuracy is decreasing over time. Which combination of slept in the
MOST operationally efficient way for the data scientist to maintain the model's accuracy?
(Select TWO.)

A. Use SageMaker Pipelines to create an automated workflow that extracts fresh data,trains the model, and deploys a new version of the model.
B. Configure SageMaker Model Monitor with an accuracy threshold to check for model drift.Initiate an Amazon CloudWatch alarm when the threshold is exceeded. Connect theworkflow in SageMaker Pipelines with the CloudWatch alarm to automatically initiateretraining.
C. Store the model predictions in Amazon S3 Create a daily SageMaker Processing jobthat reads the predictions from Amazon S3, checks for changes in model predictionaccuracy, and sends an email notification if a significant change is detected.
D. Rerun the steps in the Jupyter notebook that is hosted on SageMaker Studio notebooksto retrain the model and redeploy a new version of the model.
E. Export the training and deployment code from the SageMaker Studio notebooks into aPython script. Package the script into an Amazon Elastic Container Service (Amazon ECS)task that an AWS Lambda function can initiate.


ANSWER : A,B



MLS-C01 Sample Question 23


A manufacturing company has a production line with sensors that collect hundreds of
quality metrics. The company has stored sensor data and manual inspection results in a
data lake for several months. To automate quality control, the machine learning team must
build an automated mechanism that determines whether the produced goods are good
quality, replacement market quality, or scrap quality based on the manual inspection
results.
Which modeling approach will deliver the MOST accurate prediction of product quality?

A. Amazon SageMaker DeepAR forecasting algorithm
B. Amazon SageMaker XGBoost algorithm
C. Amazon SageMaker Latent Dirichlet Allocation (LDA) algorithm
D. A convolutional neural network (CNN) and ResNet


ANSWER : D



MLS-C01 Sample Question 24


A machine learning (ML) specialist is using the Amazon SageMaker DeepAR forecasting
algorithm to train a model on CPU-based Amazon EC2 On-Demand instances. The model
currently takes multiple hours to train. The ML specialist wants to decrease the training
time of the model.
Which approaches will meet this requirement7 (SELECT TWO )

A. Replace On-Demand Instances with Spot Instances
B. Configure model auto scaling dynamically to adjust the number of instancesautomatically.
C. Replace CPU-based EC2 instances with GPU-based EC2 instances.
D. Use multiple training instances.
E. Use a pre-trained version of the model. Run incremental training.


ANSWER : C,D



MLS-C01 Sample Question 25


A company is creating an application to identify, count, and classify animal images that are
uploaded to the company’s website. The company is using the Amazon SageMaker image
classification algorithm with an ImageNetV2 convolutional neural network (CNN). The
solution works well for most animal images but does not recognize many animal species
that are less common.
The company obtains 10,000 labeled images of less common animal species and stores
the images in Amazon S3. A machine learning (ML) engineer needs to incorporate the
images into the model by using Pipe mode in SageMaker.
Which combination of steps should the ML engineer take to train the model? (Choose two.)

A. Use a ResNet model. Initiate full training mode by initializing the network with randomweights.
B. Use an Inception model that is available with the SageMaker image classificationalgorithm.
C. Create a .lst file that contains a list of image files and corresponding class labels. Uploadthe .lst file to Amazon S3.
D. Initiate transfer learning. Train the model by using the images of less common species.
E. Use an augmented manifest file in JSON Lines format.


ANSWER : C,D



MLS-C01 Sample Question 26


A company operates large cranes at a busy port. The company plans to use machine
learning (ML) for predictive maintenance of the cranes to avoid unexpected breakdowns
and to improve productivity.
The company already uses sensor data from each crane to monitor the health of the
cranes in real time. The sensor data includes rotation speed, tension, energy consumption,
vibration, pressure, and …perature for each crane. The company contracts AWS ML
experts to implement an ML solution.
Which potential findings would indicate that an ML-based solution is suitable for this
scenario? (Select TWO.)

A. The historical sensor data does not include a significant number of data points andattributes for certain time periods.
B. The historical sensor data shows that simple rule-based thresholds can predict cranefailures.
C. The historical sensor data contains failure data for only one type of crane model that isin operation and lacks failure data of most other types of crane that are in operation.
D. The historical sensor data from the cranes are available with high granularity for the last3 years.
E. The historical sensor data contains most common types of crane failures that thecompany wants to predict.


ANSWER : D,E



MLS-C01 Sample Question 27


A company wants to forecast the daily price of newly launched products based on 3 years
of data for older product prices, sales, and rebates. The time-series data has irregular
timestamps and is missing some values.
Data scientist must build a dataset to replace the missing values. The data scientist needs
a solution that resamptes the data daily and exports the data for further modeling.
Which solution will meet these requirements with the LEAST implementation effort?

A. Use Amazon EMR Serveriess with PySpark.
B. Use AWS Glue DataBrew.
C. Use Amazon SageMaker Studio Data Wrangler.
D. Use Amazon SageMaker Studio Notebook with Pandas.


ANSWER : C



MLS-C01 Sample Question 28


A company wants to predict stock market price trends. The company stores stock market
data each business day in Amazon S3 in Apache Parquet format. The company stores 20
GB of data each day for each stock code.
A data engineer must use Apache Spark to perform batch preprocessing data
transformations quickly so the company can complete prediction jobs before the stock
market opens the next day. The company plans to track more stock market codes and
needs a way to scale the preprocessing data transformations.
Which AWS service or feature will meet these requirements with the LEAST development
effort over time?

A. AWS Glue jobs
B. Amazon EMR cluster
C. Amazon Athena
D. AWS Lambda


ANSWER : A



MLS-C01 Sample Question 29


An ecommerce company wants to use machine learning (ML) to monitor fraudulent
transactions on its website. The company is using Amazon SageMaker to research, train,
deploy, and monitor the ML models.
The historical transactions data is in a .csv file that is stored in Amazon S3 The data
contains features such as the user's IP address, navigation time, average time on each
page, and the number of clicks for ....session. There is no label in the data to indicate if a
transaction is anomalous.
Which models should the company use in combination to detect anomalous transactions?
(Select TWO.)

A. IP Insights
B. K-nearest neighbors (k-NN)
C. Linear learner with a logistic function
D. Random Cut Forest (RCF)
E. XGBoost


ANSWER : D,E



MLS-C01 Sample Question 30


A data scientist is working on a public sector project for an urban traffic system. While
studying the traffic patterns, it is clear to the data scientist that the traffic behavior at each
light is correlated, subject to a small stochastic error term. The data scientist must model
the traffic behavior to analyze the traffic patterns and reduce congestion.
How will the data scientist MOST effectively model the problem?

A. The data scientist should obtain a correlated equilibrium policy by formulating thisproblem as a multi-agent reinforcement learning problem.
B. The data scientist should obtain the optimal equilibrium policy by formulating thisproblem as a single-agent reinforcement learning problem.
C. Rather than finding an equilibrium policy, the data scientist should obtain accuratepredictors of traffic flow by using historical data through a supervised learning approach.
D. Rather than finding an equilibrium policy, the data scientist should obtain accuratepredictors of traffic flow by using unlabeled simulated data representing the new trafficpatterns in the city and applying an unsupervised learning approach.


ANSWER : A



MLS-C01 Sample Question 31


A company wants to create an artificial intelligence (Al) yoga instructor that can lead large
classes of students. The company needs to create a feature that can accurately count the
number of students who are in a class. The company also needs a feature that can
differentiate students who are performing a yoga stretch correctly from students who are
performing a stretch incorrectly.
...etermine whether students are performing a stretch correctly, the solution needs to
measure the location and angle of each student's arms and legs A data scientist must use
Amazon SageMaker to ...ss video footage of a yoga class by extracting image frames and
applying computer vision models.
Which combination of models will meet these requirements with the LEAST effort? (Select
TWO.)

A. Image Classification
B. Optical Character Recognition (OCR)
C. Object Detection
D. Pose estimation
E. Image Generative Adversarial Networks (GANs)


ANSWER : C,D



MLS-C01 Sample Question 32


A company wants to predict the classification of documents that are created from an
application. New documents are saved to an Amazon S3 bucket every 3 seconds. The
company has developed three versions of a machine learning (ML) model within Amazon
SageMaker to classify document text. The company wants to deploy these three versions to predict the classification of each document.
Which approach will meet these requirements with the LEAST operational overhead?

A. Configure an S3 event notification that invokes an AWS Lambda function when newdocuments are created. Configure the Lambda function to create three SageMaker batchtransform jobs, one batch transform job for each model for each document.
B. Deploy all the models to a single SageMaker endpoint. Treat each model as aproduction variant. Configure an S3 event notification that invokes an AWS Lambdafunction when new documents are created. Configure the Lambda function to call eachproduction variant and return the results of each model.
C. Deploy each model to its own SageMaker endpoint Configure an S3 event notificationthat invokes an AWS Lambda function when new documents are created. Configure theLambda function to call each endpoint and return the results of each model.
D. Deploy each model to its own SageMaker endpoint. Create three AWS Lambdafunctions. Configure each Lambda function to call a different endpoint and return theresults. Configure three S3 event notifications to invoke the Lambda functions when newdocuments are created.


ANSWER : B



MLS-C01 Sample Question 33


A company is using Amazon Polly to translate plaintext documents to speech for
automated company announcements However company acronyms are being
mispronounced in the current documents How should a Machine Learning Specialist
address this issue for future documents?

A. Convert current documents to SSML with pronunciation tags
B. Create an appropriate pronunciation lexicon.
C. Output speech marks to guide in pronunciation
D. Use Amazon Lex to preprocess the text files for pronunciation


ANSWER : B



MLS-C01 Sample Question 34


An online delivery company wants to choose the fastest courier for each delivery at the
moment an order is placed. The company wants to implement this feature for existing users
and new users of its application. Data scientists have trained separate models with
XGBoost for this purpose, and the models are stored in Amazon S3. There is one model fof
each city where the company operates.
The engineers are hosting these models in Amazon EC2 for responding to the web client
requests, with one instance for each model, but the instances have only a 5% utilization in
CPU and memory, ....operation engineers want to avoid managing unnecessary resources.
Which solution will enable the company to achieve its goal with the LEAST operational
overhead?

A. Create an Amazon SageMaker notebook instance for pulling all the models fromAmazon S3 using the boto3 library. Remove the existing instances and use the notebook toperform a SageMaker batch transform for performing inferences offline for all the possibleusers in all the cities. Store the results in different files in Amazon S3. Point the web clientto the files.
B. Prepare an Amazon SageMaker Docker container based on the open-source multimodelserver. Remove the existing instances and create a multi-model endpoint inSageMaker instead, pointing to the S3 bucket containing all the models Invoke theendpoint from the web client at runtime, specifying the TargetModel parameter according tothe city of each request.
C. Keep only a single EC2 instance for hosting all the models. Install a model server in theinstance and load each model by pulling it from Amazon S3. Integrate the instance with theweb client using Amazon API Gateway for responding to the requests in real time,specifying the target resource according to the city of each request.
D. Prepare a Docker container based on the prebuilt images in Amazon SageMaker.Replace the existing instances with separate SageMaker endpoints. one for each citywhere the company operates. Invoke the endpoints from the web client, specifying the URL and EndpomtName parameter according to the city of each request.


ANSWER : B



MLS-C01 Sample Question 35


A data engineer is preparing a dataset that a retail company will use to predict the number
of visitors to stores. The data engineer created an Amazon S3 bucket. The engineer
subscribed the S3 bucket to an AWS Data Exchange data product for general economic
indicators. The data engineer wants to join the economic indicator data to an existing table
in Amazon Athena to merge with the business data. All these transformations must finish
running in 30-60 minutes.
Which solution will meet these requirements MOST cost-effectively?

A. Configure the AWS Data Exchange product as a producer for an Amazon Kinesis datastream. Use an Amazon Kinesis Data Firehose delivery stream to transfer the data toAmazon S3 Run an AWS Glue job that will merge the existing business data with theAthena table. Write the result set back to Amazon S3.
B. Use an S3 event on the AWS Data Exchange S3 bucket to invoke an AWS Lambdafunction. Program the Lambda function to use Amazon SageMaker Data Wrangler tomerge the existing business data with the Athena table. Write the result set back toAmazon S3.
C. Use an S3 event on the AWS Data Exchange S3 bucket to invoke an AWS LambdaFunction Program the Lambda function to run an AWS Glue job that will merge the existingbusiness data with the Athena table Write the results back to Amazon S3.
D. Provision an Amazon Redshift cluster. Subscribe to the AWS Data Exchange productand use the product to create an Amazon Redshift Table Merge the data in AmazonRedshift. Write the results back to Amazon S3.


ANSWER : B



MLS-C01 Sample Question 36


A machine learning (ML) specialist is using Amazon SageMaker hyperparameter
optimization (HPO) to improve a model’s accuracy. The learning rate parameter is specified
in the following HPO configuration:


During the results analysis, the ML specialist determines that most of the training jobs had
a learning rate between 0.01 and 0.1. The best result had a learning rate of less than 0.01.
Training jobs need to run regularly over a changing dataset. The ML specialist needs to
find a tuning mechanism that uses different learning rates more evenly from the provided
range between MinValue and MaxValue.
Which solution provides the MOST accurate result?

A.Modify the HPO configuration as follows: Select the most accurate hyperparameter configuration form this HPO job.
B.Run three different HPO jobs that use different learning rates form the following intervalsfor MinValue and MaxValue while using the same number of training jobs for each HPOjob:[0.01, 0.1][0.001, 0.01][0.0001, 0.001]Select the most accurate hyperparameter configuration form these three HPO jobs.
C.Modify the HPO configuration as follows: Select the most accurate hyperparameter configuration form this training job.
D.Run three different HPO jobs that use different learning rates form the following intervalsfor MinValue and MaxValue. Divide the number of training jobs for each HPO job by three:[0.01, 0.1][0.001, 0.01][0.0001, 0.001]Select the most accurate hyperparameter configuration form these three HPO jobs.


ANSWER : C



MLS-C01 Sample Question 37


A retail company wants to build a recommendation system for the company's website. The
system needs to provide recommendations for existing users and needs to base those
recommendations on each user's past browsing history. The system also must filter out any
items that the user previously purchased.
Which solution will meet these requirements with the LEAST development effort?

A. Train a model by using a user-based collaborative filtering algorithm on AmazonSageMaker. Host the model on a SageMaker real-time endpoint. Configure an Amazon APIGateway API and an AWS Lambda function to handle real-time inference requests that theweb application sends. Exclude the items that the user previously purchased from theresults before sending the results back to the web application.
B. Use an Amazon Personalize PERSONALIZED_RANKING recipe to train a model.Create a real-time filter to exclude items that the user previously purchased. Create anddeploy a campaign on Amazon Personalize. Use the GetPersonalizedRanking APIoperation to get the real-time recommendations.
C. Use an Amazon Personalize USER_ PERSONAL IZATION recipe to train a modelCreate a real-time filter to exclude items that the user previously purchased. Create anddeploy a campaign on Amazon Personalize. Use the GetRecommendations API operationto get the real-time recommendations.
D. Train a neural collaborative filtering model on Amazon SageMaker by using GPU instances. Host the model on a SageMaker real-time endpoint. Configure an Amazon APIGateway API and an AWS Lambda function to handle real-time inference requests that theweb application sends. Exclude the items that the user previously purchased from theresults before sending the results back to the web application.


ANSWER : C



MLS-C01 Sample Question 38


A company deployed a machine learning (ML) model on the company website to predict
real estate prices. Several months after deployment, an ML engineer notices that the
accuracy of the model has gradually decreased.
The ML engineer needs to improve the accuracy of the model. The engineer also needs to
receive notifications for any future performance issues.
Which solution will meet these requirements?

A. Perform incremental training to update the model. Activate Amazon SageMaker Model Monitor to detect model performance issues and to send notifications.
B. Use Amazon SageMaker Model Governance. Configure Model Governance toautomatically adjust model hyper para meters. Create a performance threshold alarm inAmazon CloudWatch to send notifications.
C. Use Amazon SageMaker Debugger with appropriate thresholds. Configure Debugger tosend Amazon CloudWatch alarms to alert the team Retrain the model by using only datafrom the previous several months.
D. Use only data from the previous several months to perform incremental training toupdate the model. Use Amazon SageMaker Model Monitor to detect model performanceissues and to send notifications.


ANSWER : A



MLS-C01 Sample Question 39


A company wants to detect credit card fraud. The company has observed that an average
of 2% of credit card transactions are fraudulent. A data scientist trains a classifier on a
year's worth of credit card transaction data. The classifier needs to identify the fraudulent
transactions. The company wants to accurately capture as many fraudulent transactions as
possible.
Which metrics should the data scientist use to optimize the classifier? (Select TWO.)

A. Specificity
B. False positive rate
C. Accuracy
D. Fl score
E. True positive rate


ANSWER : D,E



MLS-C01 Sample Question 40


An insurance company developed a new experimental machine learning (ML) model to
replace an existing model that is in production. The company must validate the quality of
predictions from the new experimental model in a production environment before the
company uses the new experimental model to serve general user requests.
Which one model can serve user requests at a time. The company must measure the
performance of the new experimental model without affecting the current live traffic
Which solution will meet these requirements?

A. A/B testing
B. Canary release
C. Shadow deployment
D. Blue/green deployment


ANSWER : C



MLS-C01 Sample Question 41


A university wants to develop a targeted recruitment strategy to increase new student
enrollment. A data scientist gathers information about the academic performance history of
students. The data scientist wants to use the data to build student profiles. The university
will use the profiles to direct resources to recruit students who are likely to enroll in the
university.
Which combination of steps should the data scientist take to predict whether a particular
student applicant is likely to enroll in the university? (Select TWO)

A. Use Amazon SageMaker Ground Truth to sort the data into two groups named"enrolled" or "not enrolled."
B. Use a forecasting algorithm to run predictions.
C. Use a regression algorithm to run predictions.
D. Use a classification algorithm to run predictions
E. Use the built-in Amazon SageMaker k-means algorithm to cluster the data into twogroups named "enrolled" or "not enrolled."


ANSWER : A,D



MLS-C01 Sample Question 42


A car company is developing a machine learning solution to detect whether a car is present
in an image. The image dataset consists of one million images. Each image in the dataset
is 200 pixels in height by 200 pixels in width. Each image is labeled as either having a car
or not having a car.
Which architecture is MOST likely to produce a model that detects whether a car is present
in an image with the highest accuracy?

A. Use a deep convolutional neural network (CNN) classifier with the images as input.Include a linear output layer that outputs the probability that an image contains a car.
B. Use a deep convolutional neural network (CNN) classifier with the images as input.Include a softmax output layer that outputs the probability that an image contains a car.
C. Use a deep multilayer perceptron (MLP) classifier with the images as input. Include alinear output layer that outputs the probability that an image contains a car.
D. Use a deep multilayer perceptron (MLP) classifier with the images as input. Include asoftmax output layer that outputs the probability that an image contains a car.


ANSWER : A



MLS-C01 Sample Question 43


A data scientist is building a linear regression model. The scientist inspects the dataset and
notices that the mode of the distribution is lower than the median, and the median is lower
than the mean.
Which data transformation will give the data scientist the ability to apply a linear regression
model?

A. Exponential transformation
B. Logarithmic transformation
C. Polynomial transformation
D. Sinusoidal transformation


ANSWER : B



MLS-C01 Sample Question 44


A network security vendor needs to ingest telemetry data from thousands of endpoints that
run all over the world. The data is transmitted every 30 seconds in the form of records that
contain 50 fields. Each record is up to 1 KB in size. The security vendor uses Amazon
Kinesis Data Streams to ingest the data. The vendor requires hourly summaries of the
records that Kinesis Data Streams ingests. The vendor will use Amazon Athena to query
the records and to generate the summaries. The Athena queries will target 7 to 12 of the
available data fields.
Which solution will meet these requirements with the LEAST amount of customization to
transform and store the ingested data?

A. Use AWS Lambda to read and aggregate the data hourly. Transform the data and storeit in Amazon S3 by using Amazon Kinesis Data Firehose.
B. Use Amazon Kinesis Data Firehose to read and aggregate the data hourly. Transformthe data and store it in Amazon S3 by using a short-lived Amazon EMR cluster.
C. Use Amazon Kinesis Data Analytics to read and aggregate the data hourly. Transformthe data and store it in Amazon S3 by using Amazon Kinesis Data Firehose.
D. Use Amazon Kinesis Data Firehose to read and aggregate the data hourly. Transform the data and store it in Amazon S3 by using AWS Lambda.


ANSWER : C



MLS-C01 Sample Question 45


A machine learning (ML) engineer has created a feature repository in Amazon SageMaker
Feature Store for the company. The company has AWS accounts for development,
integration, and production. The company hosts a feature store in the development
account. The company uses Amazon S3 buckets to store feature values offline. The
company wants to share features and to allow the integration account and the production
account to reuse the features that are in the feature repository. Which combination of steps will meet these requirements? (Select TWO.)

A. Create an IAM role in the development account that the integration account andproduction account can assume. Attach IAM policies to the role that allow access to thefeature repository and the S3 buckets.
B. Share the feature repository that is associated the S3 buckets from the developmentaccount to the integration account and the production account by using AWS ResourceAccess Manager (AWS RAM).
C. Use AWS Security Token Service (AWS STS) from the integration account and theproduction account to retrieve credentials for the development account.
D. Set up S3 replication between the development S3 buckets and the integration andproduction S3 buckets.
E. Create an AWS PrivateLink endpoint in the development account for SageMaker.


ANSWER : A,B



MLS-C01 Sample Question 46


An Amazon SageMaker notebook instance is launched into Amazon VPC The SageMaker
notebook references data contained in an Amazon S3 bucket in another account The
bucket is encrypted using SSE-KMS The instance returns an access denied error when
trying to access data in Amazon S3.
Which of the following are required to access the bucket and avoid the access denied
error? (Select THREE)

A. An AWS KMS key policy that allows access to the customer master key (CMK)
B. A SageMaker notebook security group that allows access to Amazon S3
C. An 1AM role that allows access to the specific S3 bucket
D. A permissive S3 bucket policy
E. An S3 bucket owner that matches the notebook owner
F. A SegaMaker notebook subnet ACL that allow traffic to Amazon S3.


ANSWER : A,B,C