A nancial services company needs to aggregate daily stock trade data from the exchanges into a data store. The company requires that data be streamed directly into the data store, but also occasionally allows data to be modi ed using SQL. The solution should integrate complex, analytic queries running with minimal latency. The solution must provide a business intelligence dashboard that enables viewing of the top contributors to anomalies in stock prices.Which solution meets the company's requirements?
Answer(s): C
Using Kinesis Data Firehose to load into Redshift provides near-real-time streaming into a data warehouse with SQL analytics and low-latency BI dashboards via QuickSight. Redshift supports complex analytic queries with high performance, satisfying both streaming ingestion and SQL modification needs, plus BI capabilities for anomaly contributor dashboards.A) Firehose to S3 with Athena/QuickSight: stores data in S3 and queries with Athena; not a data warehouse with low-latency analytical queries or seamless SQL updates in the store.B) Kinesis Data Streams to Redshift: slightly more complex setup; Firehose is preferred for managed delivery into Redshift with simpler maintenance.D) Data Streams to S3 and Athena: lacks a data warehouse layer and direct streaming into a query-optimized store, increasing latency for analytics.
A nancial company hosts a data lake in Amazon S3 and a data warehouse on an Amazon Redshift cluster. The company uses Amazon QuickSight to build dashboards and wants to secure access from its on-premises Active Directory to Amazon QuickSight.How should the data be secured?
Answer(s): A
A) Using an Active Directory connector and single sign-on (SSO) in a corporate network environment. DAS-C01 emphasizes integrating on-premises identity with AWS services. AD Connector enables federated SSO to QuickSight, simplifying secure access from on-prem AD. Other options are incorrect: B) SSO alone via IAM roles does not address AD integration; Redshift authentication via IAM lacks AD federation. C) S3 and VPC endpoints do not provide AD-backed SSO for QuickSight access. D) Security groups and S3 endpoint alone do not establish AD-based authentication or SSO to QuickSight; requires identity federation rather than network controls.
A real estate company has a mission-critical application using Apache HBase in Amazon EMR. Amazon EMR is con gured with a single master node. The company has over 5 TB of data stored on an Hadoop Distributed File System (HDFS). The company wants a cost-effective solution to make its HBase data highly available.Which architectural pattern meets company's requirements?
Answer(s): D
A robust high-availability pattern for HBase on EMR is to deploy a primary multi-master HBase cluster and a separate read-replica cluster in another AZ, both backed by EMRFS with consistent view and using the same S3-based root. This enables failover and read-scale redundancy without duplicating write paths, while cost-effectively leveraging S3 as durable storage.A) Uses spot instances and multiple masters but mentions automated snapshots; does not provide a dedicated read-replica architecture and cross-AZ resilience for HBase writes. B) EMRFS with S3 root for HBase is not sufficient for high availability of the HBase masters or writes. C) Two clusters in different AZs pointing to same HBase root does not specify a synchronized write-capable replica; may risk data consistency. D) Correct: primary plus read-replica in separate AZ with shared S3-backed root ensures HA and read scalability.
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hbase-s3.html
A software company hosts an application on AWS, and new features are released weekly. As part of the application testing process, a solution must be developed that analyzes logs from each Amazon EC2 instance to ensure that the application is working as expected after each deployment. The collection and analysis solution should be highly available with the ability to display new information with minimal delays.Which method should the company use to collect and analyze the logs?
A) Incorrect: Detailed monitoring+CloudWatch agent→S3+Athena adds latency and is not as near-real-time as OpenSearch dashboards; S3/Athena are not optimized for continuous streaming analytics.B) Incorrect: KPL→Kinesis Data Streams→OpenSearch with QuickSight introduces extra visualization step and potential latency; QuickSight is not the typical near-real-time OpenSearch visualization path.C) Correct: KPL on EC2 sends to Kinesis Data Firehose, which streams to OpenSearch Service and OpenSearch Dashboards, providing low-latency, highly available log analytics with real-time visualization.D) Incorrect: CloudWatch Logs subscriptions to Kinesis Data Streams adds unnecessary hops; OpenSearch Dashboards are best paired with Firehose for direct ingestion.
https://docs.aws.amazon.com/AmazonCloudWatch/latest/logs/Subscriptions.html
A data analyst is using AWS Glue to organize, cleanse, validate, and format a 200 GB dataset. The data analyst triggered the job to run with the Standard worker type. After 3 hours, the AWS Glue job status is still RUNNING. Logs from the job run show no error codes. The data analyst wants to improve the job execution time without overprovisioning.Which actions should the data analyst take?
Answer(s): B
The correct answer is B. Enabling job metrics helps estimate DPUs required, and increasing the maximum capacity (DPUs) allows Glue to allocate more resources for parallelism, reducing overall runtime without overprovisioning when tuned to actual workload.A) Incorrect: Job bookmarks track data lineage and state, not resource estimation; increasing executor-cores is not the direct lever for Glue Spark parallelism since the recommended knob is DPUs (capacity).C) Incorrect: spark.yarn.executor.memoryOverhead is a Spark/YARN tuning parameter for driver/executor memory overhead, not the primary lever in Glue’s serverless/managed environment where DPUs determine capacity.D) Incorrect: Job bookmarks do not inform resource estimation, and num-executors is not a Glue parameter; Glue uses DPUs (capacity) rather than explicit executor counts.
https://docs.aws.amazon.com/glue/latest/dg/monitor-debug-capacity.html
A company has a business unit uploading .csv les to an Amazon S3 bucket. The company's data platform team has set up an AWS Glue crawler to do discovery, and create tables and schemas. An AWS Glue job writes processed data from the created tables to an Amazon Redshift database. The AWS Glue job handles column mapping and creating the Amazon Redshift table appropriately. When the AWS Glue job is rerun for any reason in a day, duplicate records are introduced into the Amazon Redshift table.Which solution will update the Redshift table without duplicates when jobs are rerun?
A) The staging-and-merge approach ensures idempotent loads: load into a staging table, then use postactions to upsert/replace into the main Redshift table, preventing duplicates on reruns. This aligns with ETL patterns for Redshift where you apply a controlled merge or replace of the target to maintain uniqueness.B) Introducing MySQL adds cross-system complexity and latency; Redshift is the target, and this approach is not idiomatic for AWS Glue/Redshift pipelines, plus it risks integrity and consistency.C) DataFrame.dropDuplicates() eliminates duplicates in Spark but does not guarantee deduplication against existing target rows in Redshift across reruns; it only affects in-memory/partitioned data before write.D) ResolveChoice selects a single schema type for conflicting columns, not deduplicating data or preventing repeated inserts into Redshift.
A streaming application is reading data from Amazon Kinesis Data Streams and immediately writing the data to an Amazon S3 bucket every 10 seconds. The application is reading data from hundreds of shards. The batch interval cannot be changed due to a separate requirement. The data is being accessed by AmazonAthena. Users are seeing degradation in query performance as time progresses.Which action can help improve query performance?
The correct answer is A: Merge the files in Amazon S3 to form larger files. Smaller, many small objects lead to suboptimal Athena query performance due to increased metadata handling and partition pruning overhead; larger files improve scan efficiency and reduce per-object overhead.B) Increasing Kinesis shards does not directly improve Athena query performance; it affects ingest parallelism, not how data is stored or scanned in S3.C) Adding memory/CPU to the streaming app does not influence how Athena reads S3 data or file layout.D) Writing to multiple S3 buckets adds cross-bucket query complexity and does not inherently enhance performance.
A company uses Amazon OpenSearch Service (Amazon Elasticsearch Service) to store and analyze its website clickstream data. The company ingests 1 TB of data daily using Amazon Kinesis Data Firehose and stores one day's worth of data in an Amazon ES cluster. The company has very slow query performance on the Amazon ES index and occasionally sees errors from Kinesis Data Firehose when attempting to write to the index. The Amazon ES cluster has 10 nodes running a single index and 3 dedicated master nodes. Each data node has 1.5 TB of Amazon EBS storage attached and the cluster is con gured with 1,000 shards. Occasionally, JVMMemoryPressure errors are found in the cluster logs.Which solution will improve the performance of Amazon ES?
The bottleneck is too many shards for the given data and node resources, causing high overhead, JVM memory pressure, and slower queries. Reducing shards improves search performance and reduces JVM GC pressure, making better use of available heap and I/O. C) Decrease the number of shards for the index aligns with best practices for small-to-moderate datasets per node and reduces overhead, improving query latency and stability. A) Increasing master node memory has negligible impact on data-node JVM pressure and query performance. B) Decreasing data nodes reduces parallelism and throughput, worsening latency. D) Increasing shards would worsen JVM pressure and overhead, not improve performance.
Share your comments for Amazon DAS-C01 exam with other users:
good questions
hi, could you please update the latest dump version
this question is keep repeat : you are developing a sales application that will contain several azure cloud services and handle different components of a transaction. different cloud services will process customer orders, billing, payment, inventory, and shipping. you need to recommend a solution to enable the cloud services to asynchronously communicate transaction information by using xml messages. what should you include in the recommendation?
great questions
its realy good
oracle 1z0-1059-22 dumps
please share me the pdf..
q50: which two functions can be used by an end user when pivoting an interactive report? the correct answer is a, c because we do not have rank in the function pivoting you can check in the apex app
best to practice
so far it is good
please provide me the dump
i failed the cisa exam today. but i have found all the questions that were on the exam to be on this site.
in question 272 the right answer states that an autonomous acces point is "configured and managed by the wlc" but this is not what i have learned in my ccna course. is this a mistake? i understand that lightweight aps are managed by wlc while autonomous work as standalones on the wlan.
it was helpful
good question
really nice
please i need dumps for isc2 cybersecuity
ans is coldline i think
very helpful
can you please provide dumps so that it helps me more
thank you for providing me with the updated question and answers. this version has all the questions from the exam. i just saw them in my exam this morning. i passed my exam today.
how i can see exam questions?
can you please upload please?
question 75: option c is correct answer
please add this exam
please upoad
has anyone recently attended safe 6.0 certification? is it the samq question from here.
expository experience
52 should be b&c. controller failure has nothing to do with this type of issue. degraded state tells us its a raid issue, and if the os is missing then the bootable device isnt found. the only other consideration could be data loss but thats somewhat broad whereas b&c show understanding of the specific issues the question is asking about.
great help!!!
very useful tools
looks a good platform to prepare az-104
want to pass the exam
good resource