I had the opportunity to meet Dr. Claude de Loupy in Paris last month at the LeWeb conference. Dr. de Loupy heads French startup company Syllabs, which develops semantic analysis tools for data mining, text extraction and various other applications which I will get to in a bit.

The Semantic Web is a group of methods and technologies to allow machines to understand the meaning – or “semantics” – of information on the Internet. You can read more about it here. This goes way beyond language recognition. It is the ability to understand the meaning of content. One of the basic applications of the semantic web is Sentiment analysis, or opinion mining. Another application is text generation: semantic web software can automatically generate human quality text from a database.

You can read more about the tools they are developing from the Syllabs API documentation. A brief summary of these tools:

  • Language detection: this is a pretty standard feature which Syllabs offers
  • Text extraction: mine pure content from the Internet while stripping pages of ads, menus, footers
  • Named entity extraction: semantic tagging of content to identify named entities such as locations, names of persons, names of organizations
  • Related keywords: find keywords for SEO based on linguistic analysis
  • Sentiment Analysis

Syllabs is part of the TTC initiative which is funded by the EU. In addition to this funding, Syllabs has a number of commercial product offerings in a SaaS model.

Dr. de Loupy was kind enough to provide an online demo of two of the API features.

Related Keywords

This feature can be used for SEO. It is similar to Google’s keywords tool but offers more linguistically accurate results based on advanced algorithms, according to Dr. de Loupy. Since you can enter a number of keywords at the same time, Syllabs tool will provide keywords that are in context with the rest of the keywords. Syllabs tools also provides data that allows you to rank the importance of the keywords it mines, for more effective SEO of website content.

Semantic tagging

Tags content by identifying names of people, organizations, events and their relationships to each other.

Related posts:

  1. Will IBM translation software top Google?