uspto.gov
Skip over navigation

Patent Application Publication Data Products

A patent application is a document submitted by an inventor requesting a patent be issued.

A patent application document contains bibliographic front page information, an abstract (summary), specification and claims as originally filed, and drawings depicting the invention.

Patent Application Publication Multi-Page Images (2001-current Calendar Year)

Contains the images of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to present in Tagged Image File Format (TIFF) Revision 6.0 with CCITT Group 4 Compression (multi-page TIFFs) from the USPTO USAApp optical disc product (discontinued 12/31/2011).

Each weekly file contains approx. 5,000 patent application publications. Approx. 6 GB (compressed) per week. Entire collection approx. 4 TB.

Available weekly (7-14 days after publication) for no charge:  http://patents.reedtech.com/pampi.php

Back to top

Patent Application Publication Single-Page Images (current Calendar Year)

Contains the images of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) from March 15,2001 to present in Tagged Image File Format (TIFF) Revision 6.0 with CCITT Group 4 Compression (single-page TIFFs).

Each weekly file contains approximately 5,000 published patent applications. Approx. 8 GB (compressed) per week. Backfiles are approx. 10 GB (compressed). Entire collection approx. 4 TB. 

Available weekly for no charge:  http://patents.reedtech.com/payb.php   

Documentation:  http://www.uspto.gov/products/xml-resources.jsp

 (*An annual subscription for the current Calendar Year is available on blu-ray discs for $5,200. Contact ipd@uspto.gov for ordering information.)

Back to top

Patent Application Publication Full-Text (2001-current Calendar Year)

Contains the full text of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) in CY2013 (excludes images/drawings). The file format is eXtensible Markup Language (XML) in accordance with the Patent Application Version 4.3 International Common Element (ICE) Document Type Definition (DTD). These files are a subset and concatenation of the Patent Application Publication Data/XML Version 4.3 ICE. Because of the concatenation of the individual XML documents, these files will not parse successfully or open/display by default in Internet Explorer. They also will not import into MS Excel. Each XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 5,000 plus start/end tag combinations. If you take one document out of the Patent Application Publication Full Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will parse/open the document successfully. NOTE:  You may receive a warning about Active X controls. NOTE:  All Patent Application Publication Full Text files will open successfully in MS Word; NotePad; WordPad; and TextPad.

These product files (52 zip files totaling TBD GB - compressed) are available for no charge from:  http://patents.reedtech.com/parbft.php   

Documentation:  http://www.uspto.gov/products/xml-resources.jsp

 (*An annual subscription for the current Calendar Year is available for $2,500. Contact ipd@uspto.gov for ordering information.)

Back to top

Patent Application Publication Full-Text with Embedded Images (2001-current Calendar Year)

Contains the full text, images/drawings, and complex work units (tables, mathematical expressions, chemical structures, and genetic sequence data) of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) in CY2013. The file format is eXtensible Markup Language (XML) in accordance with the Patent Application Version 4.3 International Common Element (ICE) Document Type Definition (DTD). Tables and sequence data are included using CALS markup. Mathematical expressions are included using MATHML markup and external Mathematica Notebook (NB) files. Chemical structures are represented by external CambridgeSoft Corp. ChemDraw (CDX) files and MDL Information Systems (MOL) files. Drawings, mathematical expressions, and chemical structures are also included as external Tagged Image File Format (TIFF) Revision 6.0 with CCITT Group 4 Compression image files. Each weekly file contains approx. 5,000 patent application publications. There can be an optional weekly Supplemental zip file that contains lengthy sequence listings (anything over 300 pages) or lengthy tables (anything over 200 pages).

Available for no charge:  http://patents.reedtech.com/parbfti.php

Documentation:  http://www.uspto.gov/products/xml-resources.jsp

 (*An annual subscription for the current Calendar Year is available on DVD-ROMs for $5,200. Contact ipd@uspto.gov for ordering information.)

Back to top

Patent Application Publication Bibliographic (2001-current Calendar Year)

Contains the bibliographic text (i.e., front page) of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) in CY2013 (excludes images/drawings). The file format is eXtensible Markup Language (XML) in accordance with the Patent Application Version 4.3 International Common Element (ICE) Document Type Definition (DTD). These files are a subset and concatenation of the Patent Application Publication Data/XML Version 4.3 ICE (Text Only). Because of the concatenation of the individual XML documents, these files will not parse successfully or open/display by default in Internet Explorer. They also will not import into MS Excel. Each XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 5,000 plus start/end tag combinations. If you take one document out of the Patent Application Publication Bibliographic file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will parse/open the document successfully. NOTE:  You may receive a warning about Active X controls. NOTE:  All Patent Application Publication Bibliographic files will open successfully in MS Word; NotePad; WordPad; and TextPad. Available on publication day (Thursdays).

Approx. 2.7 MB per week (compressed).

These product files are available for no charge from:  http://patents.reedtech.com/parbbib.php   

This product includes an ipabyyyymmdd_wknn.zip file for each week [where "yyyymmdd" is a Thursday publication date and "nn" is a two-digit, fixed-length number (with leading zero) representing the sequentially-numbered week of the year]. Within each weekly zip file are (3) files:  ipabyyyymmdd.xml (Bibliographic information in XML ICE); ipabyyyymmddlst.txt (List of published patent application numbers in ascending order); and ipabyyyymmddrpt.html (Statistical/summary report).

Documentation:  http://www.uspto.gov/products/xml-resources.jsp

Back to top

United States Patent and Trademark Office
This page is owned by Public Information Services Group.
Last Modified: 10/30/2013 1:05:33 PM