Together with our partner we have started working on the new research project “RECAM”. This project is a cooperation project between deecoob Technology GmbH and Forensic Sience Investigation Lab at the University of Applied Sciences Mittweida. The abbreviation “RECAM” stands for retrospektive event monitoring for computer forensic enlightenment of copyright misuse on the basis of publicly available digital media.
The goal is to develop a technology for automated, computer-forensic investigation of use and misuse of copyrights at music events. The application uses web crawlers and full-text indexers and provides interfaces with Facebook, Instagram, Twitter, epaper, and Web pages. The purpose is to analyze, process, and qualify textual data from publicly available sources.
One basis for this is the development of a text mining method based on Elasticsearch search engine technology. An automated multi-vector space-based retrieval system should be able to recognize similar documents that contain content relevant to copyright. Variable cascading information filters increase retrival quality. This is where the LSI (Latent Semantic Indexing) method comes into play. A novel dynamically growing document clustering method is used for the first time in the field of text retrieval. It ensures that no processing and storage of personal data (to be protected by data law) takes place, since no personal metadata of a document is transferred to the system.
The project runs from September 2017 until August 2019. The research project is funded by the Central Innovation Program for SMEs of the Federal Ministry of Economics and Energy.