v1.9 1999-04-02 Specification of file names for Patent Data / SGML Patent Data/SGML will be delivered on one DLT per week. For each patent, the tape will contain exactly one *.ZIP file which in turn contains all the files for that one patent. Within the zip file, there will be exactly one *.SGM file and any number of associated files for complex work units (CWUs, which includes chemical structures, mathematical formulae, tables, and gene sequence listings), for drawings, and for any characters which are rendered as bitmaps (so-called "pullouts"). In addition to the SGML markup of all text content and references to pullouts, the *.SGM file includes MathML markup for formulas (made SGML-compliant), CALS Table markup for tables, SGML markup for sequence listings, and references to each of the associated files. For further details about SGM file content, see the Red Book specification. File names consist of the following components, as needed, in the order shown. aacccccccc-nnnnnnnn-annnnn-nnnn.aaa AA Issuing country (US) CCCCCCCC Patent number (8 characters or numbers) - Dash NNNNNNNN Issue date as YYYYMMDD (MM and DD left-padded) - Dash A Content type (D, C, M, T, S, or P) NNNNN Left-padded sequence number - Dash NNNN Left-padded page number . Period AAA File format (ZIP, SGM, TIF, CDX, MOL, NB) The sequence numbers represent the order in which the CWUs of a given type appear in the printed document. If a CWU is so large that more than one page is required for the printed document, then the image of each printed page will be in a separate file and numbered as shown below. Examples: US06000000-19990120.ZIP The compressed file US06000000-19990120.SGM SGML and other markup USD0367557-19990120-D00001.TIF First drawing image US06000000-19990120-C00001.TIF First chemistry image US06000000-19990120-C00001.CDX CDX file for same US06000000-19990120-C00001.MOL MOL file for same US06000000-19990120-M00001.TIF First math image US06000000-19990120-M00001.NB Mathematica file for same US06000000-19990120-T00001.TIF First table image US06000000-19990120-T00002-0001.TIF Second table image, first page USRE035111-19990120-T00002-0002.TIF Second table image, second page US06000000-19990120-S00001.TIF First sequence listing US06000000-19990120-S00002-0001.TIF Second sequence listing, first page US06000000-19990120-S00002-0002.TIF Second sequence listing, second page USPP023555-19990120-P00039.TIF Thirty-ninth pullout image