Build a data lake with Apache Flink on Amazon EMR
AWS Big Data Blog
by Jianwei Li
1d ago
To build a data-driven business, it is important to democratize enterprise data assets in a data catalog. With a unified data catalog, you can quickly search datasets and figure out data schema, data format, and location. The AWS Glue Data Catalog provides a uniform repository where disparate systems can store and find metadata to keep track of data in data silos. Apache Flink is a widely used data processing engine for scalable streaming ETL, analytics, and event-driven applications. It provides precise time and state management with fault tolerance. Flink can process bounded stream (batch) a ..read more
Visit website
Advanced reporting and analytics for the Post Call Analytics (PCA) solution with Amazon QuickSight
AWS Big Data Blog
by Ankur Taunk
1d ago
Organizations with contact centers benefit from advanced analytics on their call recordings to gain important product feedback, improve contact center efficiency, and identify coaching opportunities for their staff. The Post Call Analytics (PCA) solution uses AWS machine learning (ML) services like Amazon Transcribe and Amazon Comprehend to extract insights from contact center call audio recordings uploaded after the call, or from integration with our companion Live Call Analytics (LCA) solution. You can visualize the PCA insights in the business intelligence (BI) tool Amazon QuickSight for ad ..read more
Visit website
Diligent enhances customer governance with automated data-driven insights using Amazon QuickSight
AWS Big Data Blog
by Vidya Kotamraju
1d ago
This post is co-written with Vidya Kotamraju and Tallis Hobbs, from Diligent. Diligent is the global leader in modern governance, providing software as a service (SaaS) services across governance, risk, compliance, and audit, helping companies meet their environmental, social, and governance (ESG) commitments. Serving more than 1 million users from over 25,000 customers around the world, we empower transformational leaders with software, insights, and confidence to drive greater impact and lead with purpose. We provide the right governance technology that empowers our customers to act strategi ..read more
Visit website
Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting Started
AWS Big Data Blog
by Akira Ajisaka
2d ago
AWS Glue is a serverless, scalable data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources. AWS Glue provides an extensible architecture that enables users with different data processing use cases. A common use case is building data lakes on Amazon Simple Storage Service (Amazon S3) using AWS Glue extract, transform, and load (ETL) jobs. Data lakes free you from proprietary data formats defined by the business intelligence (BI) tools and limited capacity of proprietary storage. In addition, data lakes help you break down data silos to ..read more
Visit website
Automate deployment and version updates for Amazon Kinesis Data Analytics applications with AWS CodePipeline
AWS Big Data Blog
by Anand Shah
2d ago
Amazon Kinesis Data Analytics is the easiest way to transform and analyze streaming data in real time using Apache Flink. Customers are already using Kinesis Data Analytics to perform real-time analytics on fast-moving data generated from data sources like IoT sensors, change data capture (CDC) events, gaming, social media, and many others. Apache Flink is a popular open-source framework and distributed processing engine for stateful computations over unbounded and bounded data streams. Although building Apache Flink applications is typically the responsibility of a data engineering team, auto ..read more
Visit website
Super-charged pivot tables in Amazon QuickSight
AWS Big Data Blog
by Bhupinder Chadha
3d ago
Amazon QuickSight is a fast and cloud-powered business intelligence (BI) service that makes it easy to create and deliver insights to everyone in your organization without any servers or infrastructure. QuickSight dashboards can also be embedded into applications and portals to deliver insights to external stakeholders. Additionally, with Amazon QuickSight Q, end-users can simply ask questions in natural language to get machine learning (ML)-powered visual responses to their questions. Recently, Amazon FinTech migrated all their financial reporting to QuickSight. This involved migrating comple ..read more
Visit website
Amazon OpenSearch Serverless is now generally available!
AWS Big Data Blog
by Pavani Baddepudi
3d ago
We ended 2022 on a high note with the preview release of Amazon OpenSearch Serverless at re:Invent. Today, we are happy to announce the general availability of Amazon OpenSearch Serverless, the serverless option for Amazon OpenSearch Service that makes it easier to run search and analytics workloads without even having to think about infrastructure management. In this post, we share our approach and high-level architecture of OpenSearch Serverless. Background Self-managed OpenSearch and managed OpenSearch Service are widely used to search and analyze petabytes of data. Both options give you fu ..read more
Visit website
How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize
AWS Big Data Blog
by Byungjun Choi
3d ago
This post is co-written with Byungjun Choi and Sangha Yang from SikSin. SikSin is a technology platform connecting customers with restaurant partners serving their multiple needs. Customers use the SikSin platform to search and discover restaurants, read and write reviews, and view photos. From the restaurateurs’ perspective, SikSin enables restaurant partners to engage and acquire customers in order to grow their business. SikSin has a partnership with 850 corporate companies and more than 50,000 restaurants. They issue restaurant e-vouchers to more than 220,000 members, including individuals ..read more
Visit website
Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation
AWS Big Data Blog
by Vivek Shrivastava
4d ago
AWS Lake Formation helps with enterprise data governance and is important for a data mesh architecture. It works with the AWS Glue Data Catalog to enforce data access and governance. Both services provide reliable data storage, but some customers want replicated storage, catalog, and permissions for compliance purposes. This post explains how to create a design that automatically backs up Amazon Simple Storage Service (Amazon S3), the AWS Glue Data Catalog, and Lake Formation permissions in different Regions and provides backup and restore options for disaster recovery. These mechanisms can be ..read more
Visit website
Build a serverless analytics application with Amazon Redshift and Amazon API Gateway
AWS Big Data Blog
by David Zhang
4d ago
Serverless applications are a modernized way to perform analytics among business departments and engineering teams. Business teams can gain meaningful insights by simplifying their reporting through web applications and distributing it to a broader audience. Use cases can include the following: Dashboarding – A webpage consisting of tables and charts where each component can offer insights to a specific business department. Reporting and analysis – An application where you can trigger large analytical queries with dynamic inputs and then view or download the results. Management systems – An a ..read more
Visit website

Follow AWS Big Data Blog on Feedspot

Continue with Google
OR