This represents a high level architecture for a Media Video AI system. I call it the {A}rt of {R}eporting, {T}raining, {I}nferance, & {S}tate {T}racking or A.R.T.I.S.T. for short.
0. CI/CD: Jenkins Continuous integration & delivery of docker container environments.
- Capture & Prep: Captures Video and prepares it for processing by ML
- Model Training: Training of models to be used by Detectors.
- HITL & Data Collection DB: Transactional DB for managing data entry and labeling.
- HITL Team: Outsourced team to perform labeling and data entry.
- Job & State Mgmt: Job Management for scheduling and running ML tasks.
- Job & State Mgmt DB: Transactional DB for managing processes and states.
- Detectors: Inference Engines for detecting content in video/audio/text
- Videos & ML Results S3 Buckets: Video, Frames Audio, and ML Detection Storage.
- 3rd Party ML Services: Voice to Text, and other types of NLP or video detection service
- Audience Behavior & Ratings Warehouse: Storage and large volume processing warehouse DB
- 3rd Party Watch Log Providers: Watch Event log data providers & audience/critic panels
- Gallery Data Warehouse: Finale Data warehousing of Gallery for integration with other services
- Gallery DB Cache: Gallery data distributed across the world and localized to its common language
- Gallery UI: Public UI for customers to view the media Gallery
No comments:
Post a Comment