Drupal Module For Entity Tagging And Related Content

August 13th, 2011 by Oskar Olsson

We are bringing the semantic capabilities of the Saplo Text Analysis API to Drupal, one of the greatest and most widely used open source Content Management Systems (CMS).

The supported languages are Swedish and English, with additional languages coming in the future depending on interest.

The module works with Drupal 6.x and PHP 5 or later. If you’re interested in seeing a version for Drupal 7 please contact us. At this moment the module is being reviewed by the Drupal community, but you’re free to download it using Git from the sandbox page.

The Saplo Drupal module supports two of the main features from the Saplo API.

Drupal module for entity tagging and related content

Tagging

Using our module you can – with the push of a button – extract tags from articles.
The benefits of this versus a manual approach are twofold; first of all it’s very convenient to have the tags magically appear with the click of a button. Second of all you add consistency to the tagging, since the Saplo Text Analysis API is based on machine learning and will intelligently analyze and extract the tags the same way every time. It’s like having a dedicated employee just doing your tagging.

Related Articles

The module also provides you with a means to display related content, for example lists of “Related articles” that are often found at the end of news articles.
Similar modules have the capability to do this as well, but are often powered solely on tags. In other words, they make the assumption that articles that share the same tags should be related.
The Saplo API goes beyond that; we use semantic analysis and machine learning to give a higher precision to this relevance.

The caveat is that the module can only give you related content if that content has been previously analyzed. So as you analyze more and more of your articles or text, naturally more related content will be able to be displayed.
We basically establish relationships between different pieces of content, which is measured in a perceptual value. E.g. “this article is 60% similar to that article”. These relationships will be stored in the database powering your Drupal installation; so you’re free to use it in interesting ways if you want to.

Included in the module is also a basic filter, where you can change the relevance thresholds for displayed related content. So if you only want to display articles that are very related, you can easily do so in the administrative interface.

Using the Saplo Drupal Module

Installing the module is easy; you will be able to find the complete guide on the module page. In essence, it goes like this:

  1. Download and unpack the module inside your module folder.
  2. Enable the module in the Drupal administrative interface.
  3. Enter the API Key-pair that you’ve received from signing up at http://saplo.com/signup/free into the modules administrative interface.
  4. Enable the service you want (tagging, related articles) for your node types.
  5. Insert a new node, and when inside the view click the “Add Saplo Metadata”-button.

Drupal module settings for entity tagging and related content

The “Add Saplo Metadata”-button is probably the most common way you will be interacting with the module, besides changing settings inside the administrative interface.
Pressing this button sends the text from the node to the Saplo API. Once the Saplo API is done analysing the text it sends back the the extracted metadata that you’ve requested. The data is then saved locally, and displayed when the node is viewed.

Other features

The module implements various useful Drupal hooks. Amongst others we have different permissions that you can enable for users to access the administrative interface and to extract metadata for articles.
We also provide a small template that you can further configure that shows the related articles.