Post
2296
Dropped a dataset on here for linking org data: half a billion records scraped from LinkedIn networks. Positive/negative matches, bipartite graphs, Markov clusters – all the goods to train models that actually work on fuzzy company names.
NegMatches, PosMatches, holdouts for eval.
Check it out: cjerzak/LinkOrgs
NegMatches, PosMatches, holdouts for eval.
Check it out: cjerzak/LinkOrgs