CPC G06F 40/284 (2020.01) [G06F 16/3347 (2019.01); G06F 40/242 (2020.01); G06N 5/04 (2013.01); G06N 20/00 (2019.01)] | 20 Claims |
1. A computer-implemented method, comprising:
obtaining a log generated by one or more components in an information technology environment;
generating one or more tokens based on text in the log;
filtering the one or more tokens based on a language dictionary to identify a subset of the one or more tokens;
converting the subset of the one or more tokens into a vector that is labeled with an indication of a first log sourcetype; and
training, using the vector, a machine learning model to predict whether a log sourcetype of a log applied to the trained machine learning model as an input is the first log sourcetype.
|