Ohad Shamir, Sivan Sabato, et al.
Theoretical Computer Science
We present an approach for cataloging an organization's skill assets based on electronic communications. Our approach trains classifiers using messages from skill-related discussion groups and then applies those classifiers to a different distribution of person-related e-mail messages. We present a general framework, called cross training, for addressing such discrepancies between the training and test distributions. We outline two instances of the general cross-training problem, develop algorithms for each, and empirically demonstrate the efficacy of our solution in the skill-mining context.
Ohad Shamir, Sivan Sabato, et al.
Theoretical Computer Science
Preeti Malakar, Thomas George, et al.
SC 2012
Leo Liberti, James Ostrowski
Journal of Global Optimization
Thomas M. Cheng
IT Professional