Logistics-wise things have held up rather well. Once a query is processed for the first time, the result is stored for subsequent retrieval (though reprocessing occurs eventually) . The overwhelming majority of the queries of the last few weeks were already stored. 3000 new queries were handled mostly without congestion by my backend server.
While Cinetrii can return really interesting results, it can also be disappointing at times, as I’m sure most users stumbling upon the site have experienced. I’m currently writing my master thesis which takes most of my time, but I’ve outlined some much needed improvements that I will try to roll out during the year:
Ability for users to report erroneous connections.
More robust method of discriminating between multiple reviews on the same HTML page – possibly incorporating topic modeling.
Developing a new scoring algorithm leaning further towards supervised machine learning.
Possibly look into alternative review sources to extend coverage for non-English spoken film.