CPC G06F 16/288 (2019.01) [G06F 16/24556 (2019.01); G06F 16/27 (2019.01); G06F 16/273 (2019.01); G06F 16/275 (2019.01); G06F 16/278 (2019.01); G06N 5/04 (2013.01)] | 20 Claims |
1. A method for managing data collection in a distributed system where data is collected in a data aggregator of the distributed system and from a data collector of the distributed system that is operably connected to the data aggregator via a communication system, the method comprising:
obtaining, by the data aggregator, a data set for the data collector;
obtaining, by the data aggregator and using the data set, a feature relationship model comprising causal relationships between features of the data set;
selecting, by the data aggregator and using the feature relationship model, a data reduction plan based on acceptable error thresholds associated with the features;
configuring, by the data aggregator, the data collector to send reduced size data based on the data reduction plan;
obtaining, by the data aggregator, reduced size data from the configured data collector; and
reconstructing, by the data aggregator, data upon which the reduced size data is based using the feature relationship model.
|