The Company Repository, CoRepo, is an open search engine and data provider which tracks all companies with an online footprint. We crawl the open web and use machine learning to predict if a website is a content site or a company homepage. We let users filter on meta data such as company age, funding status, and size. This meta data is extracted from the open web with NLP methods. One could describe CoRepo as a google without all the content websites.