Patent Examination Research Dataset (Public PAIR)

The original release of the Patent Examination Research Dataset (PatEx) contained detailed information on 9.2 million publicly viewable patent applications filed with the USPTO through December 2014. The first three updates to the dataset are available as well, the most recent posted in December 2018 (and referred to as the 2017 release). This release covered all activity through 2017, but also includes activity through mid July of 2018.

The latest version of PatEx (referred to below as the 2019 release) contains detailed information on 11.3 million publicly-viewable provisional and non-provisional patent applications to the USPTO and nearly 4.2 million Patent Cooperation Treaty (PCT) applications. It is based on data that OCE downloaded from the Patent Examination Data System (PEDS) on April 26, 2020. The PEDS data are sourced from Public PAIR. This is the first time that OCE has used PEDS as the basis of PatEx. We took the PEDS data and organized it into the familiar PatEx data files, which are based on the organization of the Public PAIR portal. The data files include information on each application’s characteristics, prosecution history, continuation history, claims of foreign priority, patent term adjustment history, publication history, and correspondence address information. However, there are some minor differences between the new PatEx release and the previous ones. Because of this, we provide new technical documentation for the 2019 release, which can be found here.

The OCE developed these data files for public use and encourage users to identify fixes and improvements. Please provide all feedback to >EconomicsData@uspto.gov.

Documentation

Original Documentation (For 2014 through 2017 Releases)

A document describing these data sets is available and can be cited as: Graham, Stuart J.H. and Marco, Alan C. and Miller, Richard, The USPTO Patent Examination Research Dataset: A Window on the Process of Patent Examination (November 30, 2015). Available at SSRN: https://ssrn.com/abstract=2702637.

Understanding how patent examination records become public is crucial to the proper analysis of the PatEx data. Thus, the document focuses primarily on the coverage of the underlying Public PAIR data and how it has evolved over time. It also includes several appendices that provide more detailed descriptions of the data elements in each of the files. These appendices can be accessed separately by clicking on the following links.

Appendix A: Description of the Application Data Tab Release

Appendix B: Description of the Transaction History Tab Release

Appendix C: Description of the Continuity Data Tab Release

Appendix D: Description of the Foreign Priority Tab Release

Appendix E: Description of the Patent Term Adjustment Tab Release

Appendix F: Description of the Address and Attorney/Agent Tab Release

Notes Regarding 2015 PatEx Data Files

New Technical Documentation (For 2019 Release)

However, if you are using the 2019 release, you should disregard the appendices above and refer to the new technical documentation. Please refer to the following technical documentation for the 2019 release: Miller, Richard D. Technical Documentation for the 2019 Patent Examination Research Dataset (PatEx) Release. USPTO Economic Working Paper No. 2020-4. Available here: https://www.uspto.gov/sites/default/files/documents/PatEx-2019-Technical-Doc.pdf.

Additional resource for the PatEx data is the paper, "USPTO Patent Prosecution and Examiner Performance Appraisal", and can be cited as: Marco, Alan C. and Toole, Andrew A. and Miller, Richard and Frumkin, Jesse, USPTO Patent Prosecution and Examiner Performance Appraisal (June 1, 2017). USPTO Economic Working Paper No. 2017-08. Available at SSRN: https://ssrn.com/abstract=2995674 or http://dx.doi.org/10.2139/ssrn.2995674

Data Files

Each of the files below can be downloaded in either Stata-14 (DTA) or CSV format.

Download a full set of data files (2014): [.dta format (5.42 GB)] [.csv format (4.33 GB)]

Download a full set of data files (2015): [.dta format (5.56 GB)] [.csv format (4.99 GB)]

Download a full set of data files (2016): [.dta format (4.98 GB)] [.csv format (4.36 GB)]

Download a full set of data files (2017): [.dta format (5.37 GB)][.csv format (4.8 GB)]

Download a full set of data files (2019):  [.dta format (9.4 GB)] [.csv format (7.87 GB)]

Download individual data files (the direct download pages are here: 2014, 2015, 2016, 2017, 2019).

File Name2014201620172019
application_dataDTA
1.53 GB
CSV
585 MB
DTA
1.1 GB
CSV
681 MB
DTA
1.01 GB
CSV
657 MB
DTA
1.03 GB
CSV
774 MB
all_inventorsDTA
229 MB
CSV
225 MB
DTA
348 MB
CSV
347 MB
DTA
485 MB
CSV
499 MB
DTA
427 MB
CSV
417 MB
transactionsDTA
2.55 GB
CSV
2.45 GB
DTA
2.02 GB
CSV
1.91 GB
DTA
2.21 GB
CSV
2.09 GB
DTA
2.56 GB
CSV
1.65 GB
event_codesDTA
75 KB
CSV
21.2 KB
DTA
36.4 KB
CSV
22.8 KB
DTA
37.8 KB
CSV
23.5 KB
DTA
40.7 KB
CSV
23.3 KB
status_codesDTA
8.56 KB
CSV
3.53 KB
DTA
5.87 KB
CSV
3.74 KB
DTA
6.01 KB
CSV
3.74 KB
No
data
No
data
continuity_parentsDTA
49.9 MB
CSV
48.7 MB
DTA
73.2 MB
CSV
58 MB
DTA
79 MB
CSV
63.1 MB
DTA
102 MB
CSV
80.2 MB
continuity_childrenDTA
40.9 MB
CSV
40.9 MB
DTA
47.9 MB
CSV
47.7 MB
DTA
51.9 MB
CSV
51.6 MB
DTA
63.6 MB
CSV
61.3 MB
foreign_priorityDTA
36.5 MB
CSV
35.2 MB
DTA
40.7 MB
CSV
39.4 MB
DTA
43.8 MB
CSV
41.5 MB
DTA
77 MB
CSV
47 MB
pat_term_adjDTA
823 MB
CSV
747 MB
DTA
1.12 GB
CSV
1.01 GB
DTA
1.22 GB
CSV
1.11 GB
DTA
1.28 GB
CSV
1.53 GB
pta_summaryDTA
19.6 MB
CSV
16.2 MB
DTA
25.1 MB
CSV
20.1 MB
DTA
27.5 MB
CSV
22 MB
DTA
49.3 MB
CSV
33.1 MB
pte_summaryNo
data
No
data
No
data
No
data
No
data
No
data
DTA
531 KB
CSV
345 KB
correspondence_addressDTA
165 MB
CSV
243 MB
DTA
236 MB
CSV
280 MB
DTA
276 MB
CSV
299 MB
DTA
350 MB
CSV
362 MB
attorney_agentNo
data
No
data
No
data
No
data
No
data
No
data
DTA
3.49 GB
CSV
2.96 GB

 

Additional Resources

A good primer for the art of patent examination is the Manual of Patent Examining Procedure.