|
||||
|
« Re: Conversation Category | Main | Memetic Markets » November 23, 2004 Company IdentityOver the weekend, my SEC filings harvester completed the second pass at downloading ownership-related filings. My hard-working script took nearly two weeks to complete the pull. This project is turning out to be a good lesson in aggregation and identification. After examining the data extracted from these filings, I found gaps in the filings for certain companies due to the re-assignment of CIK codes. For example, Berkshire Hathaway Inc (CIK:1067983) has a an earlier thread of filings under Berkshire Hathaway Inc /DE/ (CIK:109694). Thankfully, data about changes in company identity can be gathered from termination filings and "formerly known as" information in SEC Edgar search results. My third pass at downloading will now take into account changes in company identification from my data sources, including changing names, ticker symbols, and SEC CIKs. I am now thinking about how to additionally publish data about those points where separate threads of identity are joined. | TrackBack |
whoami?
Projects:
The Art of Unix Programming
Eric Raymond Dave Beckett Tim Berners-Lee Tim Bray Dan Brickley Marc Canter Paul Ford Seth Ladd Seb Paquet Clay Shirky Roland Tanglao Dave Winer
Syndication:
Recent Entries
Categories
Archives
|
|||