Business Optimist • Technology Realist • Old School Values

Company Data Web Harvester

Increase firmographic richness (completion) using web crawls, machine assisted modules,  and human-in-the-loop pipelines for top 1M companies to 90% coverage across all key attributes (2018):  addresses, industry classifications, revenue, employee counts, and URLs.  Lead to industry leading company data quality @90% accuracy.

Examples of breadth and depth of primary sources includes:

  • US Department of Labor via Form 5500/SF (Annual Return/Report of Employee Benefit Plan) for the number of participants
  • The IRS via Form 990 for tax-exempt organizations, nonexempt charitable trusts, and section 527 political organizations
  • File from NCES - US School Districts
  • US Govt - City, Town, and county websites and chamber of commerce
  • US Higher Education and College (enrollment and staff)
  • US Hospitals and nursing homes via National Center for Health Statistics, CDC, and American Hospital Association
  • The Investment Adviser Public Disclosure (IAPD) database for registration documents for state-registered investment advisers
  • E-Verify for employers with workforce sizes provided by Department of Homeland Security and USCIS
  • B2B Social media platforms (e.g. LinkedIn, Facebook)
  • Corporate domain annual reports, press releases, and blogs
  • Open data business registries for:  Norway, France, Germany, Denmark, Australia, UK, Netherlands, Singapore, New Zealand, and India

 

 

No Comments Yet.

Leave a comment