The kimono team is joining Palantir!

After almost 2 years of building and growing kimono, we couldn’t be happier to announce that the kimono team is joining Palantir.


As a team, we’re very proud of the product we built, now used by over 125k developers, data scientists and businesses. When we started two years ago, we had a clear mission to help people everywhere structure and extract value from data. Since then, we’ve realized that continuing to work in isolation on a general data collection tool simply won’t allow us to make the impact we want. When we met the team at Palantir, we were instantly excited by the potential — the incredible talent and access to the world’s most important data problems – even if it meant no longer working on the kimono product. Our decision to join gives us unmatched support, resources and the ability to work on things we could not tackle alone as a small startup.


What’s happening to kimono?

Because of our new roles at Palantir, it will not be possible for us to continue providing the publicly available cloud hosted kimono product. Effective 2/29/2016 we will be shutting down the service. From that point forward users will no longer be able to log into services or access any data via the website or API endpoints. Although the kimono service will not be following us to Palantir, we know how many of you depend on kimono to run your businesses/projects, so we’ve decided to provide users with a light-weight desktop version of kimono to ease the transition.

Kimono for Desktop will offer a similar experience and much of the same utility as the product at, but will no longer be a cloud hosted solution. The software, available for Mac OS X and Windows, will integrate with a new version of the chrome extension and perform all data collection locally on your machine. The software download will be free of charge and will function entirely independently from the kimono platform — and will be provided as is, without continued active development and without technical or customer support. Check out the FAQ.


What about my data?

No data will be accessible via after 2/29/2016. As part of the kimono desktop product, you will be given a 30 day window to import your APIs (ending 03/31/2016). This function will transfer the APIs themselves including API rules, configuration details like modify results functions and data collection rules/url lists but will not transfer any data. All personal information including account information/credentials, Auth API credentials and Auth API data, will be securely deleted from our servers at the end of this transfer window.

For more information, be sure to check out the FAQ.

Thank you to each of you for supporting kimono over the past two years.

Building knowledge graphs for new technologies with kimono


mergeflow provides deep insights into the next generation of technology by creating a unique knowledge graph. Large global tech corporations and investors use this graph to search for specific technologies and discover the products, research, patents, start-ups, and experts associated with it.

mergeflow builds this graph with information from a broad cross-section of sources that have been fed through its proprietary natural language and machine learning algorithms to determine trends and associations. Using Mergeflow’s knowledge base, global tech corporations can innovate future products and technology, and investors can lead the pack in finding and investing in new tech.

Screen Shot 2015-04-23 at 11.50.23 AM

mergeflow collects this information for its algorithms by creating streamlined and standardized kimono APIs off of hundreds of sites, from university research centers, patent offices, regulatory bodies, press outlets, tech publications, research projects, databases and blogs etc. Using kimono, Mergeflow can collect information from a much broader set of sources and focus their efforts on the natural language and machine learning algorithms that provide insight and the graph that maps the next generation of technology.

Read more about mergeflow here. mergeflow wrote a great tutorial blog about how they used kimono to create an API off of “Bayerische Patentallianz”, a Bavarian government organization that hosts interesting technology offers that come out of Bavarian research institutions.

As an example of the kinds of insights you can get when you combine kimono and mergeflow, we looked at the following two questions:

1. Investor-university networks: how does the investor network associated with Technion compare to the investor network associated with the Technical University Munich?

2. Technology offerings: which are prominent research organizations with technology offerings in the area of natural language processing? And at these organizations, who are the relevant people?

Continue reading

How Contactive builds complete user profiles with kimono


Contactive (acquired in January 2015 by ThinkingPhones) provides an app for enhanced Caller ID, allowing individuals and businesses to view a robust user profile associated with each incoming phone number. Contactive’s value to its users depends on the completeness of their data set – i.e., how many incoming phone numbers they can identify with profile information. To make this possible, Contactive has created an Identity Graph, where they map phone numbers and emails to profiles from many different sources (e.g., Google Places, Twitter, Facebook, LinkedIn, Yellow Pages, Yelp).

Julio Viera, Contactive’s VP of Engineering, says that out of the 400k calls that pass through Contactive every day, 50% are correctly identified to at least one profile from one source.

For the remaining 50%, the problem is not that the contact information associated with each phone number doesn’t exist, but that it is spread across thousands of disparate websites – from local directories, to automotive dealership guides to real estate agent listings. This is where Julio turns to Kimono to fill in the missing data, using it to structure these disparate information sources, feeding into Contactive’s Identity Graph structured data sets from all corners of the web.

To date, Contactive has used Kimono to match ~40k (10%) numbers per day, transforming scattered information into a powerful business asset.

Transforming one web page into $1 bn of market-moving insights

We all know the web contains valuable information, but the untapped potential of information that is out there, but not structured or used correctly, is enormous.

Any doubts about that statement were put to rest recently when a web-data firm, Selerity, uncovered Twitter’s Q1 earnings results about an hour before the scheduled release. Twitter’s stock promptly fell 5 percent, losing roughly $1.6 billion of market capitalization. The stock continued to tumble throughout the day, as the earning results were lower than expected, and investors who knew and acted on the information first profited enormously.

A day later, a similar story unfolded with When news reports surfaced that had hired bankers to help review bids from potential acquirers, Salesforce shares jumped 11 percent, adding more than $5.3 billion to Salesforce’s total market value.

The stocks in these stories moved in different directions, but the takeaway is the same – early insights into company health and financial data can be enormously valuable.

Traders, portfolio managers, analysts and others in the the finance industry spend huge sums each year for access to potentially market-moving data. A Bloomberg terminal, considered to be the industry’s gold standard for news and financial information, costs more than $20,000 a year.

Finance professionals can justify such expenses because the sums they trade are so large. But individual investors or developers working on a custom trading algorithm typically lack those resources.

Now, a new class of tools is emerging to help level the information playing field, by transforming dispersed and nebulous information into structured data and powerful insights.

Kimono is one of these tools that lets anyone — from retail investors to seasoned financial professionals — gather and process information from anywhere on the web. Kimono’s ability to structure information allows anyone to capitalize on the potential value offered by internet data.

In the Twitter example, Twitter’s investor relations website posted the company’s earnings announcements to a URL that followed the same pattern as all other announcements. The firm was counting on the fact that nobody was looking at that URL, but with the ability to easily monitor URLs of interest, a web crawler like kimono can easily detect that change in base data.

While laws and regulations are in place to prevent insider knowledge from swaying the markets, there is nothing that prevents using public information smarter and faster than your competitors.

Generate high quality potential candidate leads

Good hiring is one of (if not the most) important drivers of success for any company, as great people will find creative solutions to hard problems, form a strong culture and propel a company forward. Great talent is scarce in any environment and to find the best talent you need to start with a wide and well-qualified funnel of potential candidates. This is where front-end sourcers like Aaron Lintz play a critical role. They find potential candidates from many well-known and obscure sources and research them to effectively qualify their leads based on factors including technical skill and company fit.

Aaron’s biggest challenge is honing in on the right leads, and this is where Aaron uses kimono to gain an edge. Aaron uses kimono to source potential candidates for a myriad of job functions in the US, Europe and the Middle East and Africa. In this example, we’ll focus on how Aaron finds software engineers.

Continue reading

Unlocking SEC data with kimono’s SEC API

Financial filings for public companies contain critical information for investors, analysts and many other financial professionals. However, easy and free programmatic access of SEC data remains hard to get. With the first earnings season for 2015 upon us, we wanted to make a few of your lives a little bit easier…

Our new SEC API* provides easy, programmatic access to data from ~19,000 public companies’ SEC filings. The dataset includes metrics from consolidated balance sheets, income statements, cash flow statements plus valuation, profitability, leverage and liquidity ratios. Get the latest data as filings are announced and access 5 years of historic data as well.

Screen Shot 2015-04-26 at 10.56.55 PM

With our new API you can:

    Query financials for 19k publicly traded companies over the past five years
    Set up other kimono APIs to see how company financials correlate with any other data scraped from the web
    Download the data and analyze it in excel
    Copy our code snippets and call the API directly from your own apps

We are excited to see what you do with our API and what creations are #builtwithkimono! Share it with us on twitter or via email so we can include it in our gallery.

*This API is released in partnership with EDGAR Online, an RR Donnelly company

Introducing crawl history & error reports

We’ve just rolled out crawl histories and and error reporting for all your APIs. Now you can see the full story of all your kimono API crawls on a URL level. We built this feature so that you can quickly assess and debug your crawls, rapidly identify problematic URLs, and build the most robust data set possible.

Crawl history and error reports answer:

  • When did this crawl run?
  • Which URLs were succesfully crawled?
  • Do the properties you are extracting exist on each URL?
  • Why did some URLs fail?

This should make working with all your kimono APIs easier and get you the best data possible quickly. Here’s a sample view:

Screen Shot 2015-04-21 at 5.24.47 PM