Introduction

This is a high-level diagram of our solution, detailing the primary modules and the steps involved in processing and accessing the information.

Platform Architecture Overview

https://files.gitbook.com/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FKRh36hIPzTrkxAkr2cdj%2Fuploads%2FRbtZ80u0KtbdtjeWQoaq%2Farchitecture.png?alt=media&token=ee6c1970-28f0-4b9b-a72e-e134e00c1fca

AI Data Processor Module: This module is responsible for interpreting the uploaded files and extracting the relevant information. The type of information extracted depends on the file type; currently, we support utility invoices exclusively. Our development efforts are focused on expanding capabilities to include certificates, purchase orders, invoices, and more complex documents.

Data Validation & Heuristic Module: This module executes various heuristics and validations on the information extracted from documents. Its primary objective is to ascertain the accuracy of the information and maintain consistency across all processed documents.

Auditable Storage Module: This module manages the storage of all processed information. It creates a data point that encompasses the structured information extracted and the original document from which this information was sourced. At the moment, we handle utility data points that encompass electricity, gas, and water utilities. Any alterations to a data point are meticulously tracked to document what changes were made to the information and who made them. Optionally, we can leverage blockchain technology to enable public auditing of this information.

Audit View: Data Points are accessible via our Document AI API or through a link that generates an HTML page. This feature is particularly beneficial for users who wish to make the information available for auditing purposes. The Audit View displays the entire change history of a data point, along with the original document.

Infrastructure

Our infrastructure is divided in 2 main components: