- Technology Trends in the United States -

The data provided represents technology companies organized by year and year founded with location (city, state, zip), sales, employment, primary industry, and product types. The industry and product type classifications come from the North American Industry Classification System (NAICS). It is currently a subset of a database being used for research at UMass Lowell. In addition, a comma delimited data file (zipdata.csv) containing 5-digit zip codes and lat/longs is provided.

There are two types of files available - the CompanyDataXX files and the ProductDataXX files - in which the XX represents each year from 1989 to 2003.

Click here to access the data. The zipped version of all the available data is available for download.

Detailed NAICS information can be found at http://www.census.gov/epcd/www/naics.html.

The CompanyDataXX File

The CompanyDataXX file includes the following information for individual high tech companies for each year (XX) in the United States for the years 1989 to 2003 (15 years).

Column:

  1. Year
  2. Company ID (number)
  3. City
  4. State
  5. Zip
  6. Industry type (chemicals, energy, medical, software, etc.) - code description is in the file IndustryCodes
  7. Year formed (founded)
  8. Primary NAICS (government company classification code)
  9. Sales in Millions
  10. Employment Count

The ProductDataXX File

The files ProductDataXX represent the products classification per year for each company. These files may contain more records than Excel can open.

Column:

  1. Year
  2. Company ID (same as above)
  3. NAICS product code
  4. Product Verbal Description (providing more details on the product than the NAICS code)

There are 87,659 companies in the complete data set. About 60,000 companies are included for 2003 for example. There is one company data entry for each company for each year. There are multiple product data entries for each company for each year due to the fact that companies typically produce multiple products. The company data and production data can be related through the id and year combination. There is missing data.

The web site for the full NAICS codes is https://www.census.gov/cgi-bin/sssd/naics/naicsrch?chart=2002.
Companies in each NAICS code are all searchable from the data by region and year. Remember, government sources do not supply company specific information, only the totals and only for geographical areas where there are enough companies so that specific companies can not be identified. The data contains more. Each NAICS code can help identify companies for which the specific NAICS code is the company's primary code or primary industry and, incredibly, the same NAICS code identify companies that make a product that fits the NAICS code.

The zipdata File

Column

  1. Zip code
  2. Latitude
  3. Longitude

For additional questions, contact: MICHAEL_BEST[AT]UML.EDU or UCVEK[AT]LSUS.EDU