Brief Bio

My long-lasting research interest lies in the field of Natural Language Processing, Text Mining and Machine Learning. Since 2004 I create algorithms for problems related to Fact extraction, Named-entity recognition, Text classification and Web-search. During my previous career path I was working in such companies as Yandex and Microsoft, and had a successful startup in the petroleum industry.

At Exascale Infolab my goal was to create a solid expertise in modern Big Data infrastructures (mostly distributed indices) and apply this knowledge in IR-related projects.

The core topic of my PhD thesis is Dependency-Driven Analytics which is a new pattern in data analytics, where massive volumes of largely unstructured data are accessed through a compact and semantically-rich dependency graph.

Research Interests

Big Data; Machine Learning; Text Mining; Natural Language Processing; Information Retriveval;

Selected Papers

  1. Ruslan Mavlyutov, and Philippe Cudre-Mauroux. “Managing Big Interval Data with CINTIA the Checkpoint INTerval Array.” Transactions on Big Data (TBD), 2017. Bibtex PDF
  2. Ruslan Mavlyutov, Carlo Curino, Boris Asipov, and Philippe Cudré-Mauroux. “Dependency-Driven Analytics: A Compass for Uncharted Data Oceans.” In Proceedings of the 8th Biennial Conference on Innovative Data Systems Research (CIDR 2017). Santa Cruz, USA, 2017. Bibtex Slides PDF
  3. Julia Proskurnia, Ruslan Mavlyutov, Roman Prokofyev, Karl Aberer, and Philippe Cudré-Mauroux. “Analyzing Large-Scale Public Campaigns on Twitter.” In International Conference on Social Informatics, 225–43. Springer, 2016. Bibtex
  4. Sangeetha Abdu Jyothi, Carlo Curino, Ishai Menache, Shravan Matthur Narayanamurthy, Alexey Tumanov, Ruslan Mavlyutov, Jonathan Yaniv, et al. “Morpheus: towards Automated SLOs for Enterprise Clusters.” In Proceedings of OSDI’16: 12th USENIX Symposium on Operating Systems Design and Implementation, 117, 2016. Bibtex
  5. Alexander Butyaev, Ruslan Mavlyutov, Mathieu Blanchette, Philippe Cudré-Mauroux, and Jerome Waldispuhl. “A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data.” Nucleic Acids Research, 2015. Bibtex PDF
  6. Ruslan Mavlyutov, Marcin Wylot, and Philippe Cudré-Mauroux. “A Comparison of Data Structures to Manage URIs on the Web of Data.” In ESWC. Springer, 2015. Bibtex PDF
  7. Ruslan Mavlyutov, and Philippe Cudré-Mauroux. “CINTIA: A Distributed, Low-Latency Index for Big Interval Data.” In 2015 IEEE International Conference on Big Data, Big Data 2015, Santa Clara, CA, USA, October 29 - November 1, 2015, 619–28, 2015. Bibtex PDF
  8. Roman Prokofyev, Ruslan Mavlyutov, Martin Grund, Gianluca Demartini, and Philippe Cudré-Mauroux. “Correct Me If I’m Wrong: Fixing Grammatical Errors by Preposition Ranking.” In CIKM. CIKM ’14. Shanghai, China: ACM, 2014. Bibtex PDF
  9. Alexander Butyaev, Ruslan Mavlyutov, Mathieu Blanchette, Philippe Cudré-Mauroux, and Jerome Waldispuhl. “3DGB: A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data,” n.d. Bibtex

Source of inspiration