The data provided represents technology companies organized by year and year founded with location (city, state, zip), sales, employment, primary industry, and product types. The industry and product type classifications come from the North American Industry Classification System (NAICS). It is currently a subset of a database being used for research at UMass Lowell. In addition, a comma delimited data file (zipdata.csv) containing 5-digit zip codes and lat/longs is provided.
There are two types of files available - the CompanyDataXX files and the ProductDataXX files - in which the XX represents each year from 1989 to 2003.
The CompanyDataXX file includes the following information for individual high tech companies for each year (XX) in
the United States for the years 1989 to 2003 (15 years).
The files ProductDataXX represent the products classification per year for each company. These files may contain more records than Excel can open.
There are 87,659 companies in the complete data set. About 60,000 companies are included for 2003 for example. There is one company data entry for each company for each year. There are multiple product data entries for each company for each year due to the fact that companies typically produce multiple products. The company data and production data can be related through the id and year combination. There is missing data.