Brief Bio
My long-lasting research interest lies in the field of Natural Language Processing, Text Mining and Machine Learning. Since 2004 I create algorithms for problems related to Fact extraction, Named-entity recognition, Text classification and Web-search. During my previous career path I was working in such companies as Yandex and Microsoft, and had a successful startup in the petroleum industry.
At Exascale Infolab my goal was to create a solid expertise in modern Big Data infrastructures (mostly distributed indices) and apply this knowledge in IR-related projects.
The core topic of my PhD thesis is Dependency-Driven Analytics which is a new pattern in data analytics, where massive volumes of largely unstructured data are accessed through a compact and semantically-rich dependency graph.
Research Interests
Big Data; Machine Learning; Text Mining; Natural Language Processing; Information Retriveval;
Selected Papers
- Julia Proskurnia, Ruslan Mavlyutov, Carlos Castillo, Karl Aberer, and Philippe Cudré-Mauroux. “Efficient Document Filtering Using Vector Space Topic Expansion And
Pattern-Mining: The Case of Event Detection in Microposts.” In Proceedings of the 2017 ACM on Conference on Information and Knowledge
Management, CIKM 2017, Singapore, November 06 - 10, 2017, 457–66, 2017. Bibtex PDF
Efficient Document Filtering Using Vector Space Topic Expansion and
Pattern-Mining: The Case of Event Detection in Microposts
Julia Proskurnia, Ruslan Mavlyutov, Carlos Castillo, Karl Aberer, and Philippe Cudré-Mauroux. “Efficient Document Filtering Using Vector Space Topic Expansion And
Pattern-Mining: The Case of Event Detection in Microposts.” In Proceedings of the 2017 ACM on Conference on Information and Knowledge
Management, CIKM 2017, Singapore, November 06 - 10, 2017, 457–66, 2017.
@inproceedings{proskurnia2017cikm,
author = {Proskurnia, Julia and Mavlyutov, Ruslan and Castillo, Carlos and Aberer, Karl and Cudr{\'e}{-}Mauroux, Philippe},
title = {Efficient Document Filtering Using Vector Space Topic Expansion and
Pattern-Mining: The Case of Event Detection in Microposts},
booktitle = {Proceedings of the 2017 {ACM} on Conference on Information and Knowledge
Management, {CIKM} 2017, Singapore, November 06 - 10, 2017},
pages = {457--466},
year = {2017},
doi = {10.1145/3132847.3133016},
url = {https://exascale.info/assets/pdf/proskurnia2017cikm.pdf}
}
×
- Ruslan Mavlyutov, and Philippe Cudré-Mauroux. “Managing Big Interval Data with CINTIA the Checkpoint INTerval Array.” Transactions on Big Data (TBD), 2017. Bibtex PDF
Managing Big Interval Data with CINTIA the Checkpoint INTerval Array
Ruslan Mavlyutov, and Philippe Cudré-Mauroux. “Managing Big Interval Data with CINTIA the Checkpoint INTerval Array.” Transactions on Big Data (TBD), 2017.
@article{mavlyutov2017tbd,
title = {{Managing Big Interval Data with CINTIA the Checkpoint INTerval Array}},
author = {Mavlyutov, Ruslan and Cudr{\'e}-Mauroux, Philippe},
journal = {Transactions on Big Data (TBD)},
commentvolume = {7},
commentnumber = {3},
commentpages = {30},
year = {2017},
publisher = {IEEE},
url = {https://exascale.info/assets/pdf/mavlyutov2017tbd.pdf}
}
×
- Ruslan Mavlyutov, Carlo Curino, Boris Asipov, and Philippe Cudré-Mauroux. “Dependency-Driven Analytics: A Compass for Uncharted Data Oceans.” In Proceedings of the 8th Biennial Conference on Innovative Data Systems Research (CIDR 2017). Santa Cruz, USA, 2017. Bibtex Slides PDF
Dependency-Driven Analytics: A Compass for Uncharted Data Oceans
Ruslan Mavlyutov, Carlo Curino, Boris Asipov, and Philippe Cudré-Mauroux. “Dependency-Driven Analytics: A Compass for Uncharted Data Oceans.” In Proceedings of the 8th Biennial Conference on Innovative Data Systems Research (CIDR 2017). Santa Cruz, USA, 2017.
@inproceedings{2017mavlyutov:guider,
author = {Mavlyutov, Ruslan and Curino, Carlo and Asipov, Boris and Cudr{\'e}-Mauroux, Philippe},
title = {Dependency-Driven Analytics: A Compass for Uncharted Data Oceans},
booktitle = {Proceedings of the 8th Biennial Conference on Innovative Data Systems Research (CIDR 2017)},
year = {2017},
address = {Santa Cruz, USA},
note = {http://www.slideshare.net/eXascaleInfolab/dependencydriven-analytics-a-compass-for-uncharted-data-oceans},
url = {https://exascale.info/assets/pdf/cidr2017_dependency-driven-analytics.pdf}
}
×
- Sangeetha Abdu Jyothi, Carlo Curino, Ishai Menache, Shravan Matthur Narayanamurthy, Alexey Tumanov, Ruslan Mavlyutov, Jonathan Yaniv, et al. “Morpheus: towards Automated SLOs for Enterprise Clusters.” In Proceedings of OSDI’16: 12th USENIX Symposium on Operating Systems Design and Implementation, 117, 2016. Bibtex
Morpheus: towards automated SLOs for enterprise clusters
Sangeetha Abdu Jyothi, Carlo Curino, Ishai Menache, Shravan Matthur Narayanamurthy, Alexey Tumanov, Ruslan Mavlyutov, Jonathan Yaniv, et al. “Morpheus: towards Automated SLOs for Enterprise Clusters.” In Proceedings of OSDI’16: 12th USENIX Symposium on Operating Systems Design and Implementation, 117, 2016.
@inproceedings{jyothi2016morpheus,
title = {Morpheus: towards automated SLOs for enterprise clusters},
author = {Jyothi, Sangeetha Abdu and Curino, Carlo and Menache, Ishai and Narayanamurthy, Shravan Matthur and Tumanov, Alexey and Mavlyutov, Ruslan and Yaniv, Jonathan and Goiri, {\'I}{\~n}igo and Krishnan, Subru and Kulkarni, Janardhan and Rao, Sriram},
booktitle = {Proceedings of OSDI’16: 12th USENIX Symposium on Operating Systems Design and Implementation},
pages = {117},
year = {2016}
}
×
- Julia Proskurnia, Ruslan Mavlyutov, Roman Prokofyev, Karl Aberer, and Philippe Cudré-Mauroux. “Analyzing Large-Scale Public Campaigns on Twitter.” In International Conference on Social Informatics, 225–43. Springer, 2016. Bibtex
Analyzing Large-Scale Public Campaigns on Twitter
Julia Proskurnia, Ruslan Mavlyutov, Roman Prokofyev, Karl Aberer, and Philippe Cudré-Mauroux. “Analyzing Large-Scale Public Campaigns on Twitter.” In International Conference on Social Informatics, 225–43. Springer, 2016.
@inproceedings{proskurnia2016analyzing,
title = {Analyzing Large-Scale Public Campaigns on Twitter},
author = {Proskurnia, Julia and Mavlyutov, Ruslan and Prokofyev, Roman and Aberer, Karl and Cudr{\'e}-Mauroux, Philippe},
booktitle = {International Conference on Social Informatics},
pages = {225--243},
year = {2016},
organization = {Springer}
}
×
- Ruslan Mavlyutov, Marcin Wylot, and Philippe Cudré-Mauroux. “A Comparison of Data Structures to Manage URIs on the Web of Data.” In ESWC. Springer, 2015. Bibtex PDF
A Comparison of Data Structures to Manage URIs on the Web of Data
Ruslan Mavlyutov, Marcin Wylot, and Philippe Cudré-Mauroux. “A Comparison of Data Structures to Manage URIs on the Web of Data.” In ESWC. Springer, 2015.
@conference{uriencoding,
title = {A Comparison of Data Structures to Manage URIs on the Web of Data},
booktitle = {ESWC},
year = {2015},
publisher = {Springer},
organization = {Springer},
author = {Mavlyutov, Ruslan and Wylot, Marcin and Cudr{\'e}-Mauroux, Philippe},
url = {https://exascale.info/assets/pdf/uriencoding.pdf}
}
×
- Ruslan Mavlyutov, and Philippe Cudré-Mauroux. “CINTIA: A Distributed, Low-Latency Index for Big Interval Data.” In 2015 IEEE International Conference on Big Data, Big Data 2015, Santa
Clara, CA, USA, October 29 - November 1, 2015, 619–28, 2015. Bibtex PDF
CINTIA: A distributed, low-latency index for big interval data
Ruslan Mavlyutov, and Philippe Cudré-Mauroux. “CINTIA: A Distributed, Low-Latency Index for Big Interval Data.” In 2015 IEEE International Conference on Big Data, Big Data 2015, Santa
Clara, CA, USA, October 29 - November 1, 2015, 619–28, 2015.
@inproceedings{DBLP:conf/bigdataconf/MavlyutovC15,
author = {Mavlyutov, Ruslan and Cudr{\'e}-Mauroux, Philippe},
title = {{CINTIA:} {A} distributed, low-latency index for big interval data},
booktitle = {2015 {IEEE} International Conference on Big Data, Big Data 2015, Santa
Clara, CA, USA, October 29 - November 1, 2015},
pages = {619--628},
year = {2015},
crossref = {DBLP:conf/bigdataconf/2015},
url = {https://exascale.info/assets/pdf/CINTIA.pdf},
doi = {10.1109/BigData.2015.7363806},
timestamp = {Fri, 08 Jan 2016 13:52:01 +0100},
biburl = {http://dblp.uni-trier.de/rec/bib/conf/bigdataconf/MavlyutovC15},
bibsource = {dblp computer science bibliography, http://dblp.org}
}
×
- Alexander Butyaev, Ruslan Mavlyutov, Mathieu Blanchette, Philippe Cudré-Mauroux, and Jerome Waldispuhl. “A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data.” Nucleic Acids Research, 2015. Bibtex PDF
A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data
Alexander Butyaev, Ruslan Mavlyutov, Mathieu Blanchette, Philippe Cudré-Mauroux, and Jerome Waldispuhl. “A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data.” Nucleic Acids Research, 2015.
@article{304,
title = {A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data},
journal = {Nucleic Acids Research},
year = {2015},
author = {Butyaev, Alexander and Mavlyutov, Ruslan and Blanchette, Mathieu and Cudr{\'e}-Mauroux, Philippe and Waldispuhl, Jerome},
url = {https://exascale.info/assets/pdf/Nucleic2015_3DBG.pdf}
}
×
- Roman Prokofyev, Ruslan Mavlyutov, Martin Grund, Gianluca Demartini, and Philippe Cudré-Mauroux. “Correct Me If I’m Wrong: Fixing Grammatical Errors by Preposition Ranking.” In CIKM. CIKM ’14. Shanghai, China: ACM, 2014. Bibtex PDF
Correct Me If I’m Wrong: Fixing Grammatical Errors by Preposition Ranking
Roman Prokofyev, Ruslan Mavlyutov, Martin Grund, Gianluca Demartini, and Philippe Cudré-Mauroux. “Correct Me If I’m Wrong: Fixing Grammatical Errors by Preposition Ranking.” In CIKM. CIKM ’14. Shanghai, China: ACM, 2014.
@conference{Prokofyev:2014:CIKM,
title = {Correct Me If I{\textquoteright}m Wrong: Fixing Grammatical Errors by Preposition Ranking},
booktitle = {CIKM},
series = {CIKM {\textquoteright}14},
year = {2014},
publisher = {ACM},
organization = {ACM},
address = {Shanghai, China},
keywords = {n-gram statistics, pointwise mutual information, preposition correction, supervised learning},
doi = {10.1145/2661829.2661942},
author = {Prokofyev, Roman and Mavlyutov, Ruslan and Grund, Martin and Demartini, Gianluca and Cudr{\'e}-Mauroux, Philippe},
url = {https://exascale.info/assets/pdf/ir0991-prokofyev.pdf}
}
×
- Alexander Butyaev, Ruslan Mavlyutov, Mathieu Blanchette, Philippe Cudré-Mauroux, and Jerome Waldispuhl. “3DGB: A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data,” n.d. Bibtex
3DGB: A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data
Alexander Butyaev, Ruslan Mavlyutov, Mathieu Blanchette, Philippe Cudré-Mauroux, and Jerome Waldispuhl. “3DGB: A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data,” n.d.
@article{butyaev3dgb,
title = {3DGB: A Low-Latency, Big Database System and Browser for Storage, Querying and Visualization of 3D Genomic Data},
author = {Butyaev, Alexander and Mavlyutov, Ruslan and Blanchette, Mathieu and Cudr{\'e}-Mauroux, Philippe and Waldispuhl, Jerome}
}
×
Source of inspiration