A company has a frontend ReactJS website that uses Amazon API Gateway to invoke REST APIs. The APIs perform the functionality of the website. A data engineer needs to write a Python script that can be occasionally invoked through API Gateway. The code must return results to API Gateway.Which solution will meet these requirements with the LEAST operational overhead?
Answer(s): B
A) The least operational overhead is achieved with a Lambda function that can be invoked by API Gateway and does not require managing servers or containers, with provisioned concurrency ensuring cold-start avoidance. B) Correct: AWS Lambda Python function with provisioned concurrency minimizes latency and management effort; API Gateway integration is direct, and no infrastructure provisioning is required. C) EKS adds substantial operational overhead for Kubernetes management and does not align with “least overhead” for a small script invoked via API Gateway. D) Regularly pinging a Lambda to stay warm is unnecessary with provisioned concurrency and adds unnecessary scheduling, increasing operational overhead and complexity.
A company has a production AWS account that runs company workloads. The company's security team created a security AWS account to store and analyze security logs from the production AWS account. The security logs in the production AWS account are stored in Amazon CloudWatch Logs.The company needs to use Amazon Kinesis Data Streams to deliver the security logs to the security AWS account.Which solution will meet these requirements?
Answer(s): D
Kinesis Data Streams must reside in the destination account (security account) where logs from CloudWatch Logs will be delivered via a subscription filter. Creating the destination stream in the security account and granting CloudWatch Logs permission to put records, with a subscription filter, aligns cross-account delivery without requiring cross-account Data Streams permissions in the source account.A) Wrong: destination stream in production requires cross-account Kinesis permissions; not aligned with central security account ownership.B) Wrong: subscription filter targets CloudWatch Logs to a Kinesis stream in security account, but missing cross-account trust for CloudWatch Logs in production; workflow incorrect.C) Wrong: destination stream in production; cross-account role in production to security is unnecessary and misaligned with security-centric design.D) Correct: destination stream in security account; IAM trust policy allows CloudWatch Logs to write; subscription filter enables delivery from production logs to the security account.
A company uses Amazon S3 to store semi-structured data in a transactional data lake. Some of the data files are small, but other data files are tens of terabytes.A data engineer must perform a change data capture (CDC) operation to identify changed data from the data source. The data source sends a full snapshot as a JSON file every day and ingests the changed data into the data lake.Which solution will capture the changed data MOST cost-effectively?
Answer(s): C
The correct answer is C because using an open source data lake format (such as Apache Iceberg or Delta Lake) enables ACID-compliant upserts/merges on a large-scale S3 data lake, allowing efficient CDC by merging daily full snapshots with existing data without heavy per-row processing or data movement. It minimizes storage and compute costs for tens of terabytes and small files, and supports scalable incremental updates.A) Lambda-based diffing on large datasets is prohibitively expensive and slow for multi-terabyte files.B) DMS with RDS MySQL adds relational DB maintenance and ongoing replication cost; CDC via DMS is not optimal for bulk S3 lake merging.D) Aurora Serverless with DMS adds database compute cost and complexity; not the most cost-effective for bulk lake merges.
A data engineer runs Amazon Athena queries on data that is in an Amazon S3 bucket. The Athena queries use AWS Glue Data Catalog as a metadata table.The data engineer notices that the Athena query plans are experiencing a performance bottleneck. The data engineer determines that the cause of the performance bottleneck is the large number of partitions that are in the S3 bucket. The data engineer must resolve the performance bottleneck and reduce Athena query planning time.Which solutions will meet these requirements? (Choose two.)
Answer(s): A,C
Athena planning is sped up by reducing partition discovery and enabling predicate pushdown through partition metadata, which Glue partition index and partition projection provide.A) Creates a Glue partition index and enables partition filtering to prune partitions at query planning time. B) Bucketing by a common column does not affect partition discovery or metadata pruning in Athena when using Glue Catalog; it mainly affects data layout for certain query engines but not partition pruning in this setup. C) Enables partition projection to avoid enumerating large numbers of partitions, speeding up planning by computing partitions from the S3 prefix without metadata lookups. D) Parquet formatting improves scan efficiency but does not directly reduce partition discovery or planning time; it affects I/O and scan cost rather than planning bottlenecks. E) S3DistCP merging objects reduces small object overhead but does not impact Athena’s partition planning or metadata pruning.
A data engineer must manage the ingestion of real-time streaming data into AWS. The data engineer wants to perform real-time analytics on the incoming streaming data by using time-based aggregations over a window of up to 30 minutes. The data engineer needs a solution that is highly fault tolerant.Which solution will meet these requirements with the LEAST operational overhead?
A concise, correct choice explanation:D) Fully managed Apache Flink service provides stateful, fault-tolerant stream processing with native support for time-based windows (up to 30 minutes) and multiple aggregations, minimizing operational overhead. It handles exactly-once semantics and scalable stateful processing for real-time analytics.A) Lambda with both business and analytics logic is operationally heavy and lacks built-in fault-tolerant streaming state management for long windows; potential reprocessing and state management increase overhead.B) Flink option is correct; the claim about occasional duplicates is not aligned with Flink’s strong exactly-once semantics and proper windowing.C) Lambda-based tumbling window on event time is less suitable for long-running, continuous aggregations and still requires complex orchestration and fault handling.
A company is planning to upgrade its Amazon Elastic Block Store (Amazon EBS) General Purpose SSD storage from gp2 to gp3. The company wants to prevent any interruptions in its Amazon EC2 instances that will cause data loss during the migration to the upgraded storage.Which solution will meet these requirements with the LEAST operational overhead?
GP3 supports in-place volume type conversion and allows adjusting IOPS/throughput without data migration, minimizing downtime and operational overhead.A) Incorrect: snapshots then create and attach new volumes introduces downtime during detachment/attachment and data consistency concerns; extra steps increase risk and cost.B) Incorrect: gradual data transfer requires coordination and can still affect availability; mounting changes can cause brief interruption.D) Incorrect: DataSync is unnecessary for block storage within the same volume; adds complexity and latency without reliability benefits for in-place upgrades.
A company is migrating its database servers from Amazon EC2 instances that run Microsoft SQL Server to Amazon RDS for Microsoft SQL Server DB instances. The company's analytics team must export large data elements every day until the migration is complete. The data elements are the result of SQL joins across multiple tables. The data must be in Apache Parquet format. The analytics team must store the data in Amazon S3.Which solution will meet these requirements in the MOST operationally efficient way?
A) Using Glue to read directly from a live view on EC2 is not straightforward since Glue typically crawls data stores for cataloging; accessing an on-premises or EC2-hosted SQL Server view would require a persistent connection and proper JDBC/ODBC setup, which adds operational overhead. B) SQL Server Agent on EC2 producing CSV to S3 and Lambda to convert adds unnecessary steps and potential ETL drift; managing two services increases operational burden. C) Creating a view, then using a Glue crawler to catalog the view and a Glue job to extract and convert to Parquet into S3 provides a fully managed, serverless, repeatable, and scalable workflow with minimal maintenance. D) Lambda with JDBC requires continuous connection management, cold start considerations, and scripting for Parquet conversion, which is less operationally efficient than Glue-based ETL.
A data engineering team is using an Amazon Redshift data warehouse for operational reporting. The team wants to prevent performance issues that might result from long- running queries. A data engineer must choose a system table in Amazon Redshift to record anomalies when a query optimizer identifies conditions that might indicate performance issues.Which table views should the data engineer use to meet this requirement?
A) STL_ALERT_EVENT_LOGB) STL_ALERT_EVENT_LOG is the system view that records anomalies and alerts detected by the query optimizer when potential performance issues are identified, making it the appropriate source for monitoring long-running or problematic queries. C) STL_QUERY_METRICS contains per-query metrics but does not specifically log anomalies identified by the optimizer. D) STL_PLAN_INFO provides plan details but not a centralized anomaly/alert log. A) STL_USAGE_CONTROL is related to usage controls and does not capture optimizer anomaly events.
Share your comments for Amazon Amazon-DEA-C01 exam with other users:
american history 1
good level of questions
i need this dump kindly upload it
do we need c# coding to be az204 certified
excellent topics covered
are these really financial cloud questions and answers, seems these are basic admin question and answers
are these comments real
please upload the latest dumps
a company runs its workloads on premises. the company wants to forecast the cost of running a large application on aws. which aws service or tool can the company use to obtain this information? pricing calculator ... the aws pricing calculator is primarily used for estimating future costs
looks interesting
thanks! that’s amazing
the exam dumps are helping me get a solid foundation on the practical techniques and practices needed to be successful in the auditing world.
q 14 should be dmz sever1 and notepad.exe why does note pad have a 443 connection
question # 108, correct answers are business growth and risk reduction.
are these valid chfi questions
question: 162 should be dlp (b)
good exam questions
I have to say this is really close to real exam. Passed my exam with this.
good analytics question
this looks accurate
question 46, the answer should be data "virtualization" (not visualization).
its useful.
Pass this exam 3 days ago. The PDF version and the Xengine App is quite useful.
informative for me.
question 134s answer shoule be "dlp"
in 72 the answer must be [sys_user_has_role] table.
i appreciated the mix of multiple-choice and short answer questions. i passed my exam this morning.
great to find this website, thanks
examination questions seem to be relevant.
planning to take psm test
please allow to download
please provide dumps
is the answer to question 15 correct ? i feel like the answer should be b
its getting more technical
Keeping this site free takes real effort. We constantly battle automated scraping and unauthorized content copying. A quick account helps us protect the community and keep the site free.
To continue studying for your Amazon-DEA-C01, please sign in or create a free account.