If a column or line does not make sense for a given worksheet, check the cstcodes.xls file for instructions. PPS data is now a part of the Healthcare Cost Report Information System ( H C R I S ) Data Set. PPS16 (1999) is the last PPS file. The H C R I S data files are updated quarterly with more complete and up-to-date records. I replace the old data files with the most recent. So, keep old copies if you need old data and not the most up-to-date files. Downloading of H C R I S data has been discontinued due to the large file size. The file is available for purchase ( $100 ) at http://www.cms.hhs.gov/data/cost_reports/default.asp ##To get the URL for the latest data, send an e-mail to PUFs@cms.hhs.gov with a subject of "Request URL" ##Or, skip downloading by ordering the CD-ROM for $100 at http://www.cms.hhs.gov/data/download/hcris_hospital/default.asp ##http://cms.hhs.gov/data/download/hcris_hospital_09_30_02.zip -- latest file as of 2002-01-02 (removed) hcris_hospital_09_30_02.zip contains the following: counts_09_30_02.xls -- percent completion counts (same as what's available from H C R I S website) 2552-96_specifications.xls -- Tells if a line item is alpha, numeric, or decimal. 2552-96_worksheets.zip -- same as worksheets available form H C R I S website cstcodes.xls -- description of cost center codes hosp_dm.pdf -- Relational Data Model for Hospital H C R I S I made a text version, hosp_dm.txt, using pdftotext. Has names for the 17 variables in hosp_rpt.csv, the five variables in hosp_rpt-alphnmrc.csv, and the five variables in hosp_rpt_nmrc.csv hosp_rpt.zip -- Contains hosp_rpt.csv hosp_rpt_alphnmrc.zip -- Contains hosp_rpt_alphnmrc.csv hosp_rpt_nmrc.zip -- Contains hosp_rpt_nmrc.csv (62380285 records; takes about 40 minutes to load in sas version 8) hosp_tables.sql -- Contains the code you would need to create an SQL table of the hosp_rpt*csv data files. provider_control_type.xls -- value labes for the 13 provider control types. Also saved as provider_control_type.txt typeofhospital.xls -- value labels for the 9 hospital types. Saved as .txt file as well. datadictionary.xls -- meanings of data elements in the hosp_rpt*csv files. Saved as .txt files as well. http://cms.hhs.gov/data/download/ has the following H C R I S documentation: (1) a readme file called readme_hcris.txt here, http://cms.hhs.gov/data/download/readmore_sept02_hcris.asp on CMS website It describes the files available in http://cms.hhs.gov/data/download/hcris_hospital_09_30_02.zip (2) data dictionary such as datadictionarycr2ndqtr2002.zip The data dictionary has the following sheets: (a) datadictionary.txt (b) state_codes.txt (c) facility_numbering.txt (3) percent completion such as counts_09_30_02.pdf (4) worksheets such as new255296Worksheets.zip The worksheets are in the worksheets subdirectory because the sheets have the same names as the sheets in the data files. The new255296Worksheets.zip file unzipped to a number of MS Excel worksheets: 255296_a.xls 255296_b.xls 255296_c.xls 255296_d.xls 255296_e.xls 255296_g.xls 255296_h.xls 255296_i.xls 255296_j.xls 255296_k.xls 255296_l.xls 255296_m.xls 255296_s.xls Each 255296_'letter' contained a number of sheets named 'letter', and an arabic number, and/or a Roman numeral. For example, 255296_a.xls contained the following: A, A6, A7I, A7III, A8, A81, A82, A83I, A83II, A83V, A-84 To make these files viewable with a text editor, I saved these files as tab-delimited text files such as worksheet_a.txt. The .xls files preserve the original formatting. For some reason, pkunzipping the .zip files creates files that are world-writeable. Eek! The latest date on the 2002-12-30 data is 2003-02-04. The following files stayed the same from releases 2002-09-30 to 2002-12-31: Length Method Size Ratio Date Time CRC-32 Name ------ ------ ----- ----- ---- ---- ------ ---- 1308 DeflatN 291 78% 05-15-02 08:57 c1b99eab HOSP_TABLES.sql 64000 DeflatN 15997 76% 11-01-02 12:24 245d5447 CSTCODES.XLS 32768 DeflatN 7887 76% 11-07-02 09:06 c33d875b Data Dictionary.xls 525802 Stored 525802 0% 10-22-02 14:27 83da75d5 2552-96_ Worksheets.zip 4937 DeflatN 3094 38% 05-22-02 08:47 9243cf5a HOSP_DM.pdf 13824 DeflatN 1599 89% 05-14-02 14:38 63b64b49 Provider_Control_Type.xls 14336 DeflatN 1716 89% 11-01-02 13:06 506cac65 TypeofHospital.xls 96256 DeflatN 16695 83% 04-11-02 16:15 973ca54f Worksheet_Codes.doc ------ ------ --- ------- 16849257 16341725 1% 14 To get the URL for the latest data, send an e-mail to PUFs@cms.hhs.gov with a subject of "Request URL" hcris@cms.hhs.gov -- for questions wk_xwalk.sas7bdat is a cross walk between the worksheet codes from the database, WKSHT_CD, and the worksheet name and label (if appropriate), such as the following: A83P001 A-8-3, Part I Physical Therapy (h) by Jean Roth, jroth@nber.org, 2003-01-10 http://www.cms.hhs.gov/CostReports/02_HospitalCostReport.asp#TopOfPage has hospital name and address file