US 11,809,601 B2
System, method, computer program and data signal for identifying software capable of capturing personally identifiable information
Robin Jowett, Cremorne (AU)
Assigned to Source of True Pty Ltd, Waverly (AU)
Appl. No. 16/635,581
Filed by Lens10 Pty Ltd, Cremorne (AU)
PCT Filed Aug. 2, 2018, PCT No. PCT/AU2018/050808
§ 371(c)(1), (2) Date Jan. 31, 2020,
PCT Pub. No. WO2019/023756, PCT Pub. Date Feb. 7, 2019.
Claims priority of application No. 2017903065 (AU), filed on Aug. 2, 2017.
Prior Publication US 2020/0242272 A1, Jul. 30, 2020
Int. Cl. G06F 21/62 (2013.01); G06F 16/955 (2019.01); G06F 21/57 (2013.01)
CPC G06F 21/6263 (2013.01) [G06F 16/955 (2019.01); G06F 21/577 (2013.01); G06F 2221/032 (2013.01); G06F 2221/2115 (2013.01)] 19 Claims
OG exemplary drawing
 
1. A method for identifying software capable of capturing personally identifiable information sent from a local device to a remote server, comprising the steps of:
connecting to the remote server via the local device in order to send at least one request for data and retrieve at least one packet of data in response to the request, wherein the remote server is in communication with at least one third party server;
sending at least one request for data and retrieving, from the remote server, at least one packet of data in response to the request;
determining via an unpacking and decoding module that unpacks the at least one packet of data and decodes data residing in header fields, cookies, URL's, and request bodies in the at least one packet of data and detects third party domains residing in the header fields, cookies, URL's, and request bodies whether the at least one packet of data, received by the local device, originated from the at least one third party server other than the remote server, and
if so, determining, using machine learning, whether the at least one packet of data contains evidence of the presence of the personally identifiable information provided to the remote server by the local device; and
indicating, to a user, the presence of personally identifiable information by outputting a machine learning classification output.