Activity–weight duality in feed-forward neural networks reveals two co-determinants for generalization
- Yu Feng
- Wei Zhang
- et al.
- 2023
- Nature Machine Intelligence
Paper
I received my Ph.D. from the University of Wisconsin–Madison, where my dissertation focused on improving the reliability of concurrent software through an effect-oriented approach.
I joined IBM Research as a Research Staff Member in August 2013. Since then, my research has spanned programming languages, distributed deep learning, systems for machine learning, and AI for Code. My publication record is available on Google Scholar: https://scholar.google.com/citations?user=DJMSA3YAAAAJ&hl=en
Some of my research highlights include:
At IBM Research, I have worked across several research groups: