Annif - automated subject indexing using Finna as a corpus

Annif is a statistical automated indexing tool for libraries, archives and museums. After feeding it a SKOS vocabulary and existing openly available metadata from the Finna search engine for library, archive and museum collections, it knows how to assign subjects for new documents.

Annif has a REST API and a mobile web app that can analyze physical documents such as books. With Annif, we can add semantics to documents in three projects (Finnish, Swedish and English) using our own indexing vocabulary YSO.

Code for Annif is available on Github (CC0 license).

Watch the video

Try it!


Results