About Me
I graduated from the Department of Computer Science at Cornell University and accepted a position with Google NYC. My PhD adviser was Thorsten Joachims. My research explores how document collections develop -- specifically, by detecting idea flows between documents by looking at which documents influence each other, and by detecting where in documents ideas originate. These (entirely) text-based methods can be used to detect the most influential documents and authors in document collections. Applications exist in many domains of text documents, such as news articles, web pages, research publications, digital libraries, blogs, e-mail, and so on.
Research Interests
- Data Mining, Machine Learning, Text Mining, Temporal Data Mining
- Sponsored Search, Information Retrieval, Multimedia Retrieval
My Resume
There are two versions -- a one-page version and a full-length version.
Publications
- Benyah Shaparenko, Ozgur Cetin, and Rukmini Iyer. Text Features for Sponsored Search Click Prediction. In submission.
- Benyah Shaparenko and Thorsten Joachims. Identifying the Original Contribution of a Document via Language Modeling. Paper in submission. Poster accepted at SIGIR-2009.
- Benyah Shaparenko and Thorsten Joachims. Information Genealogy: Uncovering the Flow of Ideas in Non-hyperlinked Document Databases. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp.619-628, San Jose, CA, USA, 2007.
- Benyah Shaparenko, Rich Caruana, Johannes Gehrke, and Thorsten Joachims. Identifying Temporal Patterns and Key Players in Document Collections. In Proceedings of the IEEE ICDM Workshop on Temporal Data Mining: Algorithms, Theory and Applications (TDM-05), pp. 165-174, Houston, TX, USA, 2005.
- A. M. MacEachren, G. Cai, R. Sharma, I. Rauschert, I. Brewer, L. Bolelli, B. Shaparenko, S. Fuhrmann, and H. Wang, Enabling Collaborative Geoinformation Access and Decision-Making through a Natural, Multimodal Interface, International Journal of Geographical Information Science, 19(3), pp. 293-317, 2005.
- B. Shaparenko and J. Hannan. Private State in Object-Oriented Programming Languages. Honors Thesis. The Pennsylvania State University, 2003. (Also available as a PS)
- J. Privratska, B. Shaparenko, D. Litvin, and V. Janovec. Magnetic Point Group Symmetries of Spontaneously Polarized and/or Magnetized Domain Walls, Ferroelectrics, 269, pp. 39-44, 2002.
- B. Shaparenko, J. Schlessman, and D. B. Litvin. Domain Average Engineering in Ferroics, Ferroelectrics, 269, pp. 9-14, 2002.
Education
- Cornell University, M.S., Ph.D. in Computer Science. Ithaca, NY. Research in data mining. Thesis on exploring how ideas originate and flow in collections of text documents.
- Penn State University, B.S. in Computer Science. State College, PA. Thesis on private state in object-oriented languages.
- Dalet School, high school. Bethel, PA. School of the Assemblies of Yahweh.
|