Data Ecosystem for Real-Time Analytics - Part 1

Data Ecosystem for Real-Time Analytics - Part 1

In a startup, the most exciting and challenging thing is to constantly evaluate the technology and business trend and predict the development direction of the industry based on our understanding and intuition. We keep exploring on - Why do we need Real-Time Analytics? What is event stream processing? Why do we need a streaming database? Can stream processing really replace batch processing?

In this article, we will focus on fundamental building blocks of a data ecosystem that can support Real-Time Analytics.

1.       Event streaming: Streaming data is data that is continuously generated and delivered rather than processed in batches or micro-batches. This is often referred to as “event data,” since each data point describes something that occurred at a given time.

The systems capture data in real-time from sources (or Producers) like databases, sensors, and cloud services in the form of event streams and deliver them to other applications, databases, and services (or Producers). It pertains to data storage, and stores data based on a timestamp. This is often handled by technology such as Apache Kafka and Amazon Kinesis.

In-memory stream processors stand out because of their ability to process large amounts of streaming data very quick. These systems can scale (Apache Kafka at LinkedIn supports over 7 trillion messages a day) and handle multiple, concurrent data sources and event streaming in real-time.


No alt text provided for this image
Illustration Credit : Eric Broda


 2.       Event Stream Processing: It is the practice of taking action on a series of data points that originate from a system that continuously creates data. Traditional Batch processing is about taking action on a large set of static data (“data at rest”), while Event Stream processing is about taking action on a constant flow of data (“data in motion”).

Since events are also referred to as messages, the entire system can also be referred to as a “messaging system”. These are usually the technology that helps developers write applications that take action on the events. Actions that are taken on those events include

  1. Aggregations (e.g., calculations such as sum, mean, standard deviation),
  2. Transformations (e.g., changing a number into a date format),
  3. Enrichment (e.g., combining the data point with other data sources to create more context and meaning),
  4. Analytics (e.g., predicting a future event based on patterns in the data),
  5. Ingestion (e.g., inserting the data into a database).

Use cases such as payment processing, fraud detection, anomaly detection, predictive maintenance, and IoT analytics all rely on immediate action on data. All of these use cases deal with data points in a continuous stream, each associated with a specific point in time.

Event stream processing is also valuable when data granularity is critical. The practice of Change Data Capture (CDC), in which all individual changes to a database are tracked, is another event stream processing use case. In CDC, downstream systems can use the stream of individual updates to a database for purposes such as identifying usage patterns that can help define optimization strategies, as well as tracking changes for auditing requirements.

Part 2 - will include Real-Time Analytic Database, Zero Trust Security, Testing, Architecture Patterns

Kovilur Gopala Krishnan

Tech Keynote Speaker | TOGAF Enterprise Architect | Artificial Intelligence | Robotics Learner | Data Science | Salesforce Certified-Agentforce & Data Cloud | Analytics | Digital Transformation | IT Governance | DevOps

1y

Great Article Padma Purushothaman. In Salesforce landscape and platform, CDC and Event Bus patterns are extensively used. Further, for a rollback/callback feature, Mulesoft(IPaaS) is clubbed with it for a robust enterprise solution.

Like
Reply

To view or add a comment, sign in

More articles by Padma Purushothaman

  • Govern your Data Migrations

    Is your data migration turning into a 👀 horror movie with unexpected twists and turns? Data migration, while promising…

    7 Comments
  • Are you new to run a Data Governance Program?

    You have got the buy-in to lead Data Governance program, what’s next? You can celebrate with champagne toasts 🍾, but…

  • Outdated Operations on Corporate Giants – Part 3

    Is your strategy to overcome outdated operations collecting dust on the shelf while execution struggles to find its…

    6 Comments
  • Data, Danger, Dollars

    Imagine a 👺thief, but instead of jewels, they crave 💰 data. Your precious business insights, the lifeblood of your…

    3 Comments
  • The Impact of Outdated Operations on Corporate Giants – Part 2

    Overcoming outdated operations in corporate giants is a complex challenge, but it's crucial for them to remain…

    5 Comments
  • The Impact of Outdated Operations on Corporate Giants - Part 1

    Numerous companies have faced challenges due to outdated business operations impacting their ability to adapt to…

    6 Comments
  • Fostering Critical Thinking Skills

    Are you business owner or a manager who wants to 🎭improve critical thinking skills in your company or in your teams ❓?…

    3 Comments
  • Graph Data Modeling

    Organizations today collect a large amount of data from ♨many different sources. However, raw data is not enough.

    1 Comment
  • Knowledge Graphs in Action

    Knowledge graphs could be used in organizations to 🔓unravel the intricacies of their own business processes…

    2 Comments
  • Knowledge Graph -Part 2

    Effective business decision-making requires organizations to constantly collect, process, and act on tons of data. Yet,…

    2 Comments

Insights from the community

Others also viewed

Explore topics