Publications | Suriya Gunasekar

Suriya Gunasekar, Varun Chandrasekaran, Jerry Li, Mert Yuksekgonul, Rahee Ghosh Peshawaria, Ranjita Naik, Besmira Nushi Marah I Abdin. KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval. arXiv preprint, 2023.

Mert Yuksekgonul, Varun Chandrasekaran, Erik Jones, Suriya Gunasekar, Ranjita Naik, Hamid Palangi, Ece Kamar, Besmira Nushi. Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models. arXiv preprint, 2023.

Yuanzhi Li, Sebastien Bubeck, Ronen Eldan, Allie Del Giorno, Suriya Gunasekar, Yin Tat Lee. Textbooks Are All You Need II: phi-1.5 technical report. arXiv preprint, 2023.

Suriya Gunasekar, Yi Zhang, Jyoti Aneja, Caio César Teodoro Mendes, Allie Del Giorno, Sivakanth Gopi, Mojan Javaheripi, Piero Kauffmann, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Harkirat Singh Behl, Xin Wang, Sebastien Bubeck, Ronen Kalai, Adam Tauman Eldan, Yin Tat Lee, Yuanzhi Li. Textbooks Are All You Need. arXiv preprint, 2023.

Mathieu Even, Scott Pesme, Suriya Gunasekar, Nicolas Flammarion. (S) GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability. Advances in Neural Information Processing Systems (NeurIPS), 2023.

Ananya Kumar, Ruoqi Shen, Sebastien Bubeck, Suriya Gunasekar. How to Fine-Tune Vision Models with SGD. arXiv preprint, 2022.

Yi Zhang, Arturs Backurs, Sebastien Bubeck, Ronen Eldan, Suriya Gunasekar, Tal Wagner. Unveiling Transformers with LEGO: a synthetic reasoning task. arXiv preprint, 2022.

Suriya Gunasekar. Generalization to translation shifts: a study in architectures and augmentations. arXiv preprint, 2022.

Yunhao Ge, Harkirat Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet. Neural-Sim: Learning to Generate Training Data with NeRF. European Conference on Computer Vision (ECCV), 2022.

Meena Jagadeesan, Ilya Razenshteyn, Suriya Gunasekar. Inductive bias of multi-channel linear convolutional networks with bounded weight norm. Conference on Learning Theory (COLT), 2022.

Ruoqi Shen, Sebastien Bubeck, Suriya Gunasekar. Data Augmentation as Feature Manipulation. International Conference on Machine Learning (ICML), 2022.

Yiding Jiang, Parth Natekar, Manik Sharma, Sumukh K Aithal, Dhruva Kashyap, Natarajan Subramanyam, Carlos Lassance, Daniel M Roy, Gintare Karolina Dziugaite, Suriya Gunasekar, others. Methods and Analysis of The First Competition in Predicting Generalization of Deep Learning. NeurIPS 2020 Competition and Demonstration Track, 2021.

Suriya Gunasekar, Blake Woodworth, Nathan Srebro. Mirrorless mirror descent: A natural derivation of mirror descent. International Conference on Artificial Intelligence and Statistics (AISTATS), 2021.

Xiaoxia Wu, Edgar Dobriban, Tongzheng Ren, Shanshan Wu, Zhiyuan Li, Suriya Gunasekar, Rachel Ward, Qiang Liu. Implicit regularization and convergence for weight normalization. Neural Information Processing Systems (NeurIPS), 2020.

Edward Moroshko, Blake E Woodworth, Suriya Gunasekar, Jason D Lee, Nati Srebro, Daniel Soudry. Implicit bias in deep linear classification: Initialization scale vs training accuracy. Neural Information Processing Systems (NeurIPS), 2020.

Blake Woodworth, Suriya Gunasekar, Jason D Lee, Edward Moroshko, Pedro Savarese, Itay Golan, Daniel Soudry, Nathan Srebro. Kernel and Rich Regimes in Overparametrized Models. Conference on Learning Theory (COLT), 2020.

Raman Arora, Sanjeev Arora, Joan Bruna, Nadav Cohen, Simon Du, Rong Ge, Suriya Gunasekar, Chi Jin, Jason Lee, Tengyu Ma, others. Theory of deep learning. Princeton Univ. Princeton, NJ, 2019.

Mor Shpigel Nacson, Jason Lee, Suriya Gunasekar, Pedro HP Savarese, Nathan Srebro, Daniel Soudry. Convergence of gradient descent on separable data. International Conference on Artificial Intelligence and Statistics (AISTATS), 2019.

Mor Shpigel Nacson, Suriya Gunasekar, Jason D Lee, Nathan Srebro, Daniel Soudry. Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. International Conference on Machine Learning (ICML), 2019.

Avrim Blum, Suriya Gunasekar, Thodoris Lykouris, Nati Srebro. On preserving non-discrimination when combining expert advice. Neural Information Processing Systems (NeurIPS), 2018.

Suriya Gunasekar, Jason D Lee, Daniel Soudry, Nati Srebro. Implicit bias of gradient descent on linear convolutional networks. Neural Information Processing Systems (NeurIPS), 2018.

Suriya Gunasekar, Jason Lee, Daniel Soudry, Nathan Srebro. Characterizing Implicit Bias in Terms of Optimization Geometry. International Conference on Machine Learning (ICML), 2018.

Daniel Soudry, Elad Hoffer, Mor Shpigel Nacson, Suriya Gunasekar, Nathan Srebro. The Implicit Bias of Gradient Descent on Separable Data. Journal of Machine Learning Research (JMLR), 2018.

Suriya Gunasekar, Blake E Woodworth, Srinadh Bhojanapalli, Behnam Neyshabur, Nati Srebro. Implicit regularization in matrix factorization. Neural Information Processing Systems (NeurIPS), 2017.

Blake Woodworth, Suriya Gunasekar, Mesrob I Ohannessian, Nathan Srebro. Learning Non-Discriminatory Predictors. Conference on Learning Theory (COLT), 2017.

Suriya Gunasekar, Oluwasanmi O Koyejo, Joydeep Ghosh. Preference Completion from Partial Rankings. Neural Information Processing Systems (NeurIPS), 2016.

Suriya Gunasekar. Mining structured matrices in high dimensions. 2016.

Shalmali Joshi, Suriya Gunasekar, David Sontag, Ghosh Joydeep. Identifiable phenotyping using constrained non-negative matrix factorization. Machine Learning for Healthcare Conference (MLHC), 2016.

Suriya Gunasekar, Joyce C Ho, Joydeep Ghosh, Stephanie Kreml, Abel N Kho, Joshua C Denny, Bradley A Malin, Jimeng Sun. Phenotyping using Structured Collective Matrix Factorization of Multi--source EHR Data. arXiv preprint, 2016.

Suriya Gunasekar, Arindam Banerjee, Joydeep Ghosh. Unified view of matrix completion under general structural constraints. Neural Information Processing Systems (NeurIPS), 2015.

Suriya Gunasekar, Makoto Yamada, Dawei Yin, Yi Chang. Consistent collective matrix completion under joint low rank structure. Artificial Intelligence and Statistics (AISTATS), 2015.

Suriya Gunasekar, Joydeep Ghosh, Alan C Bovik. Face detection on distorted images augmented by perceptual quality-aware features. IEEE transactions on information forensics and security, 2014.

Suriya Gunasekar, Pradeep Ravikumar, Joydeep Ghosh. Exponential family matrix completion under structural constraints. International Conference on Machine Learning (ICML), 2014.

Suriya Gunasekar, Ayan Acharya, Neeraj Gaur, Joydeep Ghosh. Noisy matrix completion using alternating minimization. Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML/PKDD), 2013.

Sindhu Raghavan, Suriya Gunasekar, Joydeep Ghosh. Review quality aware collaborative filtering. ACM conference on Recommender systems (RecSys), 2012.