The "Names Game": Harnessing Inventors' Patent Data for Economic Research

Manuel Trajtenberg; Gil Shiff; Ran Melamed

doi:10.3386/w12479

The "Names Game": Harnessing Inventors' Patent Data for Economic Research

Manuel Trajtenberg, Gil Shiff & Ran Melamed

Working Paper 12479

DOI 10.3386/w12479

Issue Date September 2006

The goal of this paper is to lay out a methodology and corresponding computer algorithms, that allow us to extract the detailed data on inventors contained in patents, and harness it for economic research. Patent data has long been used in empirical research in economics, and yet the information on the identity (i.e. the names and location) of the patents' inventors has seldom been deployed in a large scale, primarily because of the "who is who" problem: the name of a given inventor may be spelled differently across her/his patents, and the exact same name may correspond to different inventors (i.e. the "John Smith" problem). Given that there are over 2 million patents with 2 inventors per patent on average, the "who is who" problem applies to over 4 million "records", which is obviously too large to tackle manually. We have thus developed an elaborate methodology and computerized procedure to address this problem in a comprehensive way. The end result is a list of 1.6 million unique inventors from all over the world, with detailed data on their patenting histories, their employers, co-inventors, etc. Forty percent of them have more than one patent, and 70,000 have more than 10 patents. We can trace those multiple inventors across time and space, and thus study the causes and consequences of their mobility across countries, regions, and employers. Given the increasing availability of large computerized data sets on individuals, there may be plenty of opportunities to deploy this methodology to other areas of economic research as well.

This project has benefited enormously from the work of a group of extremely talented and dedicated research assistants, primarily Michael Katz, Alon Eizenberg, and Ran Eilat. Useful comments were provided by participants in numerous seminars, particularly at the NBER. We gratefully acknowledge the financial support of the National Science Foundation grant SES-0527657, the Israeli Science Foundation Grant 1289/05, the Samuel Neaman Institute through its STE Program, and the Sapir Center.
Copy Citation

Manuel Trajtenberg, Gil Shiff, and Ran Melamed, "The "Names Game": Harnessing Inventors' Patent Data for Economic Research," NBER Working Paper 12479 (2006), https://doi.org/10.3386/w12479.

Download Citation

MARC RIS BibTeΧ

The "Names Game": Harnessing Inventors' Patent Data for Economic Research

Published Versions

Related

Topics

Programs

More from the NBER