Amazon AWS Certified Machine Learning Engineer - Associate MLA-C01 Exam (page: 2)
Amazon AWS Certified Machine Learning Engineer - Associate MLA-C01
Updated on: 12-Feb-2026

HOTSPOT

An ML engineer is working on an ML model to predict the prices of similarly sized homes. The model will base predictions on several features The ML engineer will use the following feature engineering techniques to estimate the prices of the homes:

· Feature splitting
· Logarithmic transformation
· One-hot encoding
· Standardized distribution

Select the correct feature engineering techniques for the following list of features. Each feature engineering technique should be selected one time or not at all.

Hot Area:

  1. See Explanation section for answer.

Answer(s): A

Explanation:




The correct feature engineering techniques for each feature are:
13. City (name) - One-hot encoding
The city name is a categorical feature, so one-hot encoding is used to convert it into a binary vector representation for the model.
14. Type_year (type of home and year the home was built) - Feature splitting This combined feature can be split into two separate features: "type of home" and "year the home was built," for more meaningful analysis.
15. Size of the building (square feet or square meters) - Logarithmic transformation Logarithmic transformation can be applied to normalize the distribution if the size has a skewed distribution.



Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

Which AWS service or feature can aggregate the data from the various data sources?

  1. Amazon EMR Spark jobs
  2. Amazon Kinesis Data Streams
  3. Amazon DynamoDB
  4. AWS Lake Formation

Answer(s): D



Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

After the data is aggregated, the ML engineer must implement a solution to automatically detect anomalies in the data and to visualize the result.

Which solution will meet these requirements?

  1. Use Amazon Athena to automatically detect the anomalies and to visualize the result.
  2. Use Amazon Redshift Spectrum to automatically detect the anomalies. Use Amazon QuickSight to visualize the result.
  3. Use Amazon SageMaker Data Wrangler to automatically detect the anomalies and to visualize the result.
  4. Use AWS Batch to automatically detect the anomalies. Use Amazon QuickSight to visualize the result.

Answer(s): C

Explanation:

Amazon SageMaker Data Wrangler is designed to preprocess, analyze, and visualize data efficiently. It provides built-in tools for anomaly detection, allowing the ML engineer to automatically identify anomalies in the dataset. Additionally, SageMaker Data Wrangler includes visualization capabilities to explore the data and results, meeting the requirements for anomaly detection and visualization in one integrated environment.



Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

The training dataset includes categorical data and numerical data. The ML engineer must prepare the training dataset to maximize the accuracy of the model.

Which action will meet this requirement with the LEAST operational overhead?

  1. Use AWS Glue to transform the categorical data into numerical data.
  2. Use AWS Glue to transform the numerical data into categorical data.
  3. Use Amazon SageMaker Data Wrangler to transform the categorical data into numerical data.
  4. Use Amazon SageMaker Data Wrangler to transform the numerical data into categorical data.

Answer(s): C

Explanation:

Transforming categorical data into numerical data is essential for ML models that require numerical input, as it allows the algorithm to process the categorical information effectively. Amazon SageMaker Data Wrangler provides an intuitive interface for data preparation, including built-in transformations like one-hot encoding and label encoding for categorical data. Using SageMaker Data Wrangler reduces operational overhead by offering an integrated environment to preprocess data without needing to write extensive code.



Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

Before the ML engineer trains the model, the ML engineer must resolve the issue of the imbalanced data.

Which solution will meet this requirement with the LEAST operational effort?

  1. Use Amazon Athena to identify patterns that contribute to the imbalance. Adjust the dataset accordingly.
  2. Use Amazon SageMaker Studio Classic built-in algorithms to process the imbalanced dataset.
  3. Use AWS Glue DataBrew built-in features to oversample the minority class.
  4. Use the Amazon SageMaker Data Wrangler balance data operation to oversample the minority class.

Answer(s): D

Explanation:

The Amazon SageMaker Data Wrangler balance data operation provides a built-in capability to handle class imbalance by oversampling the minority class or undersampling the majority class. This solution minimizes operational effort by offering an integrated, no-code/low-code approach to address the imbalance directly within SageMaker's data preparation workflow. It ensures that the dataset is balanced, improving the performance of the ML model.



Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model's algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

The ML engineer needs to use an Amazon SageMaker built-in algorithm to train the model.

Which algorithm should the ML engineer use to meet this requirement?

  1. LightGBM
  2. Linear learner
  3. K-means clustering
  4. Neural Topic Model (NTM)

Answer(s): A



A company has deployed an XGBoost prediction model in production to predict if a customer is likely to cancel a subscription. The company uses Amazon SageMaker Model Monitor to detect deviations in the F1 score.

During a baseline analysis of model quality, the company recorded a threshold for the F1 score. After several months of no change, the model's F1 score decreases significantly.

What could be the reason for the reduced F1 score?

  1. Concept drift occurred in the underlying customer data that was used for predictions.
  2. The model was not sufficiently complex to capture all the patterns in the original baseline data.
  3. The original baseline data had a data quality issue of missing values.
  4. Incorrect ground truth labels were provided to Model Monitor during the calculation of the baseline.

Answer(s): A

Explanation:

Concept drift occurs when the statistical properties of the data change over time, meaning the relationship between input features and the target variable in the production data differs from the data used during model training. This is a common reason for the degradation of a model's performance metrics, such as the F1 score, over time. In this case, changes in customer behavior or other external factors could cause the predictions to deviate from the actual outcomes, leading to a drop in the F1 score.



A company has a team of data scientists who use Amazon SageMaker AI notebook instances to test ML models.
When the data scientists need new permissions, the company attaches the permissions to each individual role that was created during the creation of the SageMaker AI notebook instance.

The company needs to centralize management of the team's permissions.

Which solution will meet this requirement?

  1. Create a single IAM role that has the necessary permissions. Attach the role to each notebook instance that the team uses.
  2. Create a single IAM group. Add the data scientists to the group. Associate the group with each notebook instance that the team uses.
  3. Create a single IAM user. Attach the AdministratorAccess AWS managed IAM policy to the user. Configure each notebook instance to use the IAM user.
  4. Create a single IAM group. Add the data scientists to the group. Create an IAM role. Attach the AdministratorAccess AWS managed IAM policy to the role. Associate the role with the group. Associate the group with each notebook instance that the team uses.

Answer(s): A

Explanation:

By creating a single IAM role with the required permissions and attaching it to each SageMaker notebook instance, the company can centralize permission management. This solution ensures that all notebook instances share the same permissions, eliminating the need to manage permissions individually for each instance or user. It aligns with AWS best practices for role-based access control and reduces operational overhead.



Viewing Page 2 of 30



Share your comments for Amazon AWS Certified Machine Learning Engineer - Associate MLA-C01 exam with other users:

Neha 9/28/2023 1:58:00 PM

its helpful
Anonymous


Desmond 1/5/2023 9:11:00 PM

i just took my oracle exam and let me tell you, this exam dumps was a lifesaver! without them, iam not sure i would have passed. the questions were tricky and the answers were obscure, but the exam dumps had everything i needed. i would recommend to anyone looking to pass their oracle exams with flying colors (and a little bit of cheating) lol.
SINGAPORE


Davidson OZ 9/9/2023 6:37:00 PM

22. if you need to make sure that one computer in your hot-spot network can access the internet without hot-spot authentication, which menu allows you to do this? answer is ip binding and not wall garden. wall garden allows specified websites to be accessed with users authentication to the hotspot
Anonymous


381 9/2/2023 4:31:00 PM

is question 1 correct?
Anonymous


Laurent 10/6/2023 5:09:00 PM

good content
Anonymous


Sniper69 5/9/2022 11:04:00 PM

manged to pass the exam with this exam dumps.
UNITED STATES


Deepak 12/27/2023 2:37:00 AM

good questions
SINGAPORE


dba 9/23/2023 3:10:00 AM

can we please have the latest exam questions?
Anonymous


Prasad 9/29/2023 7:27:00 AM

please help with jn0-649 latest dumps
HONG KONG


GTI9982 7/31/2023 10:15:00 PM

please i need this dump. thanks
CANADA


Elton Riva 12/12/2023 8:20:00 PM

i have to take the aws certified developer - associate dva-c02 in the next few weeks and i wanted to know if the questions on your website are the same as the official exam.
Anonymous


Berihun Desalegn Wonde 7/13/2023 11:00:00 AM

all questions are more important
Anonymous


gr 7/2/2023 7:03:00 AM

ques 4 answer should be c ie automatically recover from failure
Anonymous


RS 7/27/2023 7:17:00 AM

very very useful page
INDIA


Blessious Phiri 8/12/2023 11:47:00 AM

the exams are giving me an eye opener
Anonymous


AD 10/22/2023 9:08:00 AM

3rd so far, need to cover more
Anonymous


Matt 11/18/2023 2:32:00 AM

aligns with the pecd notes
Anonymous


Sri 10/15/2023 4:38:00 PM

question 4: b securityadmin is the correct answer. https://docs.snowflake.com/en/user-guide/security-access-control-overview#access-control-framework
GERMANY


H.T.M. D 6/25/2023 2:55:00 PM

kindly please share dumps
Anonymous


Satish 11/6/2023 4:27:00 AM

it is very useful, thank you
Anonymous


Chinna 7/30/2023 8:37:00 AM

need safe rte dumps
FRANCE


1234 6/30/2023 3:40:00 AM

can you upload the cis - cpg dumps
Anonymous


Did 1/12/2024 3:01:00 AM

q6 = 1. download odt application 2. create a configuration file (xml) 3. setup.exe /download to download the installation files 4. setup.exe /configure to deploy the application
FRANCE


John 10/12/2023 12:30:00 PM

great material
Anonymous


Dinesh 8/1/2023 2:26:00 PM

could you please upload sap c_arsor_2302 questions? it will be very much helpful.
Anonymous


LBert 6/19/2023 10:23:00 AM

vraag 20c: rsa veilig voor symmtrische cryptografie? antwoord c is toch fout. rsa is voor asymmetrische cryptogafie??
NETHERLANDS


g 12/22/2023 1:51:00 PM

so far good
UNITED STATES


Milos 8/4/2023 9:33:00 AM

question 31 has obviously wrong answers. tls and ssl are used to encrypt data at transit, not at rest.
Serbia And Montenegro


Diksha 9/25/2023 2:32:00 AM

pls provide dump for 1z0-1080-23 planning exams
Anonymous


H 7/17/2023 4:28:00 AM

could you please upload the exam?
Anonymous


Anonymous 9/14/2023 4:47:00 AM

please upload this
UNITED STATES


Naveena 1/13/2024 9:55:00 AM

good material
Anonymous


WildWilly 1/19/2024 10:43:00 AM

lets see if this is good stuff...
Anonymous


Lavanya 11/2/2023 1:53:00 AM

useful information
UNITED STATES