Insights Data Schemas
Insights Data Schemas
View On GitHub
[
Front page
]
Insights Data Schemas
External data pipeline
Whole data flow
Event about new data from insights operator is consumed from Kafka. That event contains (among other things) URL to S3 Bucket
Insights operator data is read from S3 Bucket and Insights rules are applied to that data
Results (basically organization ID + account number + cluster name + insights results JSON) are stored back into Kafka, but into different topic
That results are consumed by Insights rules aggregator service that caches them
The service provides such data via REST API to other tools, like OpenShift Cluster Manager web UI, OpenShift console, etc.
Architecture diagram
Data format descriptions
Incoming messages in
platform.upload.announce
Raw data stored in S3 bucket
Data produced by OCP rules engine
Messages consumed from
ccx.ocp.results
topic
OCP results written into RDS
Rule content produced by Content Service
Rule content consumed by Content Service