NCHS's Vital Statistics Natality Birth Data -- 1968-2005

Natality Data from the National Vital Statistics System of the National Center for Health Statistics provide demographic and health data for births occuring during the calendar year. The microdata are based on information abstracted from birth certificates filed in vital statistics offices of each State and District of Columbia.

Other available birth data are Birth Cohort Linked Birth/Infant Death Data, 1983-1991, 1995-1998 , Period Linked Birth/Infant Death Data, 1995-2000 from the Perinatal Mortality Data, and Matched Multiple Birth Data, 1995-1997.

By using this data you signify your agreement with NCHS's data use rules. Works referring to the datasets or codebooks should contain a citation to NCHS. Published material derived from this data should include a citation such as this at the bottom of the table: "Source: National Center for Health Statistics (span of years used)"

Prior to 1972, data are based on a 50-percent sample of birth certificates from all States. Beginning in 1972, data are based on a 100-percent sample of birth certificates from some states and on a 50-percent sample from the remaining States. The number of States from which 100 percent of the records are used has increased from 6 in 1972 to all States and the District of Columbia in 1985. Birth data from the U.S. Territories Guam, Puerto Rico, and the U.S. Virgin Islands are available on a separate file beginning in 1994. In 1998, American Samoa and the Northern Marianas were added to the U.S. Territories files.

Demographic data include variables such as date of birth, age and educational attainment of parents, marital status, live-birth order, race, sex, and geographic area. Health data include items such as birth weight, gestation, prenatal care, attendant at birth, and Apgar score. Geographic data includes state, county, city (available for cities of 250,000+ (up to 1980) and 100,000+ (1980-)), SMSA (1980-), and metropolitan and nonmetropolitan counties.

Population files (such as natpop91.dat.Z) contain the population counts for U.S. women 15-44, those traditionally thought to be "at risk" for giving birth. The files have 2448 lines. Each line represents the count of one combination of 51 state x 6 age x 4 race x 2 Hispanic origin of mother categories. These files are available for 1991 on. Population files are not available for the U.S. Territories.

SEER provides helpful U.S. Population data for 1969 on.

Both ".Z" and ".zip" files can be uncompressed with winzip. In addition, ".Z" files can be uncompressed using the UNIX uncompress command and ".zip" files can be unzipped with pkunzip.

To check your ability to uncompress these files, download the small files compress.Z or compress.zip. These files give an example of how to read in .Z and .zip ASCII files into SAS for UNIX without decompressing the files. To download files in Internet Explorer, right click on them and select "Save Target As...". If the pdf documents appear to be all blank pages, get the latest Acrobat Reader at www.abobe.com.

Variable layouts are basically the same for periods 1972-1977, 1979-1981, 1982-1983, 1984-1985, and 1992-1994 though a few codes change across years.

Thanks to Michael Greenstone and Kenneth Chay for the 1975-1985 data.

File size: The compressed 1968-1985 files are between 50 and 130 Mb, and the compressed 1991-1994, and 1998-2002 files are 120-155 Mb. The 2003 file is over 200Mb. The compression ratio for these files is over 90%.

Because of the large size of the complete collection, we would prefer that you not download large fractions over the web. NBER internal users can obtain the data from a UNIX shell at /homes/data/natality or on an NBER PC via Network Neighborhood --> NBER --> home --> data --> natality

Updates and changes.




United States -- Data & Documentation 1968-2005
Birth Data
SAS Code
Stata Code
SPSS Code Documentation
UNIX Pkzipped Stata .do .dct
1968 natl1968.Z natl1968.zip natl1968.dta.zip natl1968.sas natl1968.do natl1968.dct natl1968.sps natl1968.pdf
1969 natl1969.Z natl1969.zip natl1969.dta.zip natl1969.sas natl1969.do natl1969.dct natl1969.sps natl1969-1971.pdf
1970 natl1970.Z natl1970.zip natl1970.dta.zip natl1970.sas natl1970.do natl1970.dct natl1970.sps
1971 natl1971.Z natl1971.zip natl1971.dta.zip natl1971.sas natl1971.do natl1971.dct natl1971.sps
1972 natl1972.Z natl1972.zip natl1972.dta.zip natl1972.sas natl1972.do natl1972.dct natl1972.sps natl1972-1977.pdf
1973 natl1973.Z natl1973.zip natl1973.dta.zip natl1973.sas natl1973.do natl1973.dct natl1973.sps
1974 natl1974.Z natl1974.zip natl1974.dta.zip natl1974.sas natl1974.do natl1974.dct natl1974.sps
1975 natl1975.Z natl1975.zip natl1975.dta.zip natl1975.sas natl1975.do natl1975.dct natl1975.sps
1976 natl1976.Z natl1976.zip natl1976.dta.zip natl1976.sas natl1976.do natl1976.dct natl1976.sps
1977 natl1977.Z natl1977.zip natl1977.dta.zip natl1977.sas natl1977.do natl1977.dct natl1977.sps
1978 natl1978.Z natl1978.zip natl1978.dta.zip natl1978.sas natl1978.do natl1978.dct natl1978.sps natl1978.pdf
1979 natl1979.Z natl1979.zip natl1979.dta.zip natl1979.sas natl1979.do natl1979.dct natl1979.sps natl1979.pdf
1980 natl1980.Z natl1980.zip natl1980.dta.zip natl1980.sas natl1980.do natl1980.dct natl1980.sps natl1980.pdf
1981 natl1981.Z natl1981.zip natl1981.dta.zip natl1981.sas natl1981.do natl1981.dct natl1981.sps natl1981.pdf
1982 natl1982.Z natl1982.zip natl1982.dta.zip natl1982.sas natl1982.do natl1982.dct natl1982.sps natl1982.pdf
1983 natl1983.Z natl1983.zip natl1983.dta.zip natl1983.sas natl1983.do natl1983.dct natl1983.sps natl1983.pdf
1984 natl1984.Z natl1984.zip natl1984.dta.zip natl1984.sas natl1984.do natl1984.dct natl1984.sps natl1984.pdf
1985 natl1985.Z natl1985.zip natl1985.dta.zip natl1985.sas natl1985.do natl1985.dct natl1985.sps natl1985.pdf
1986 natl1986.Z natl1986.zip natl1986.dta.zip natl1986.sas natl1986.do natl1986.dct natl1986.sps natl1986.pdf
1987 natl1987.Z natl1987.zip natl1987.dta.zip natl1987.sas natl1987.do natl1987.dct natl1987.sps natl1987.pdf
1988 natl1988.Z natl1988.zip natl1988.dta.zip natl1988.sas natl1988.do natl1988.dct natl1988.sps natl1988.pdf
1989 natl1989.Z natl1989.zip natl1989.dta.zip natl1989.sas natl1989.do natl1989.dct natl1989.sps natl1989.pdf
1990 natl1990.Z natl1990.zip natl1990.dta.zip natl1990.sas natl1990.do natl1990.dct natl1990.sps natl1990.pdf
1991 natl1991.Z natl1991.zip natl1991.dta.zip natl1991.sas natl1991.do natl1991.dct natl1991.sps natl1991.pdf
1992 natl1992.Z natl1992.zip natl1992.dta.zip natl1992.sas natl1992.do natl1992.dct natl1992.sps natl1992.pdf
1993 natl1993.Z natl1993.zip natl1993.dta.zip natl1993.sas natl1993.do natl1993.dct natl1993.sps natl1993.pdf
1994 natl1994.Z natl1994.zip natl1994.dta.zip natl1994.sas natl1994.do natl1994.dct natl1994.sps natl1994.pdf
1995 natl1995.Z natl1995.zip natl1995.dta.zip natl1995.sas natl1995.do natl1995.dct natl1995.sps natl1995.pdf
1996 natl1996.Z natl1996.zip natl1996.dta.zip natl1996.sas natl1996.do natl1996.dct natl1996.sps natl1996.pdf
1997 natl1997.Z natl1997.zip natl1997.dta.zip natl1997.sas natl1997.do natl1997.dct natl1997.sps natl1997.pdf
1998 natl1998.Z natl1998.zip natl1998.dta.zip natl1998.sas natl1998.do natl1998.dct natl1998.sps natl1998.pdf
1999 natl1999.Z natl1999.zip natl1999.dta.zip natl1999.sas natl1999.do natl1999.dct natl1999.sps natl1999.pdf
2000 natl2000.Z natl2000.zip natl2000.dta.zip natl2000.sas natl2000.do natl2000.dct natl2000.sps natl2000.pdf
2001 natl2001.Z natl2001.zip natl2001.dta.zip natl2001.sas natl2001.do natl2001.dct natl2001.sps natl2001.pdf
2002 natl2002.Z natl2002.zip natl2002.dta.zip natl2002.sas natl2002.do natl2002.dct natl2002.sps natl2002.pdf
The 2003 datafile is nearly four times larger than the 2002 file. This is because while the 2002 file is 352 characters wide, the 2003 file is 1297 characters wide.
The uncompressed 2002 file is about 1.3 Gb and the 2003 file is almost 5 Gb!  Old compression software with a 2 Gb limit won't work.
2003 natl2003.Z natl2003.zip natl2003.dta.zip natl2003.sas natl2003.do natl2003.dct natl2003.sps natl2003.pdf
2004 natl2004.Z natl2004.zip natl2004.dta.zip natl2004.sas natl2004.do natl2004.dct natl2004.sps natl2004.pdf
The 2005 public use data from 2005-on does not include geographic detail due to restrictions imposed by the states. This means that the 2005-on data does not include any geographic variables such as state, county, msa, etc. http://www.cdc.gov/nchs/VitalStats.htm has select tables, and http://www.cdc.gov/nchs/about/major/dvs/NCHS_DataRelease.htm   has information on requesting restricted versions of the data which include geographic identifiers, etc.
2005 natl2005.Z natl2005.zip natl2005.dta.zip natl2005.sas natl2005.do natl2005.dct natl2005.sps natl2005.pdf
* The 2003 datafile is nearly four times larger than files from previous years.   This is because while the 2002 file is 352 characters wide, the 2003 file is 1297 characters wide. The uncompressed 2002 file is about 1.3 Gb and the 2003 file is almost 5 Gb! If your compression software has a 2 Gb limit, it won't work.   Try other software such as WinRAR.

U.S. Territories Data, 1991-2004
Births Data
SAS Code
UNIX Pkzipped
1994 terr1994.dat.Z terr1994.zip terr1994.sas
1995 terr1995.dat.Z terr1995.zip terr1995.sas
1996 terr1996.dat.Z terr1996.zip terr1999.sas
1997 terr1997.dat.Z terr1997.zip
1998 terr1998.dat.Z terr1998.zip
1999 terr1999.dat.Z terr1999.zip
2000 terr2000.dat.Z terr2000.zip
2001 terr2001.dat.Z terr2001.zip
--
2002 terr2002.dat.Z terr2002.zip
--
2003 terr2003.dat.Z terr2003.zip natl03.sas
2004 terr2004.dat.Z terr2004.zip
--
Report on Final Natality Statistics FR1994 FR1995 FR1996 FR1997 FR1998 FR1999 FR2000 FR2001 FR2002
Standard Birth Certificates
sbc68-77   sbc78-88   sbc89-02   sbc03

To report errors, or if you have comments or suggestions, an interest in SAS library files for the later data, e-mail jroth@nber.org

Last Update: April 8, 2008 Created by Jean Roth September 15, 2000

 

 
Publications:
Main Publications Page
 
New This Week
Working Papers
Books              
Books in Progress
Older Books Online
Digest            
Reporter            
Bulletin on Aging & Health
Historical Bulletins
Free Subscriptions
Paid Subscriptions
 
Research:
Program descriptions and members
 
Working Group Descriptions and Papers
 
Selected Projects:
Conference on Research in Income and Wealth
Conference on Econometrics and Mathematical Economics
Sloan Science and Engineering Workforce Project
Boston Census Research Data Center
 
Call for Papers
Submit to WP Series             
 
Data:
NBER Collection
Business Cycle Dates
Latest Business Cycle Memo
New Economic Releases
Selected Sources
Current Population Survey
Economic Organizations
US Government Agencies
Other Data Collections

Economic Report of the President
Economic Indicators
Congressional Budget Office
OECD Frequently Requested Statistics
 
About
What we are
Contact us
Non-data Links    
Search              
Site Map
Help              
Employment              
Fellowships
Early History
 
People:
Staff
Researchers
Board
Contact Us
Search
 
Search via Google:
 
printit emailit