CPC G06N 3/084 (2013.01) [G06N 3/045 (2023.01); G06N 3/063 (2013.01); G06V 30/194 (2022.01)] | 15 Claims |
1. A device comprising at least one processor and at least one storage device storing instructions that, when executed by the at least one processor, cause the device to implement:
a neural network configured to perform an object detection task by processing data characterizing an input image to generate an alternative representation of the input image, the neural network comprising:
a plurality of subnetworks arranged in a sequence from lowest to highest, the plurality of subnetworks configured to process the data according to the sequence, the plurality of subnetworks comprising a plurality of module subnetworks, each of the module subnetworks comprising:
a plurality of groups of neural network layers configured to process a preceding output representation generated by a preceding subnetwork in the sequence and to generate a respective group output for each of the plurality of groups, wherein each group of the plurality of groups includes at least two successive convolutional layers comprising at least one 1×1 convolutional layer followed by one of (i) a 3×3 convolutional layer or (ii) a 5×5 convolutional layer; and
an output layer configured to process the alternative representation of the input image to generate an output for the object detection task from the input image.
|