no code implementations • 23 Feb 2024 • Sourya Basu, Suhas Lohit, Matthew Brand
Recent work by Finzi et al. (2021) directly solves the equivariance constraint for arbitrary matrix groups to obtain equivariant MLPs (EMLPs).
no code implementations • 14 Oct 2023 • Razan Baltaji, Sourya Basu, Lav R. Varshney
Inspired by the first design, we use the notion of the IS property to design a second efficient model-agnostic equivariant design for large product groups acting on a single input.
no code implementations • 15 Jul 2023 • Sourya Basu, Moulik Choraria, Lav R. Varshney
We find limits to the Transformer architecture for language modeling and show it has a universal prediction property in an information-theoretic sense.
no code implementations • 13 Oct 2022 • Sourya Basu, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy, Vijil Chenthamarakshan, Kush R. Varshney, Lav R. Varshney, Payel Das
We also provide a novel group-theoretic definition for fairness in NLG.
1 code implementation • 21 May 2022 • Sourya Basu, Jose Gallego-Posada, Francesco Viganò, James Rowbottom, Taco Cohen
Equivariance to symmetries has proven to be a powerful inductive bias in deep learning research.
1 code implementation • 10 Apr 2021 • Sourya Basu, Akshayaa Magesh, Harshit Yadav, Lav R. Varshney
We address these problems by proving a new group-theoretic result in the context of equivariant neural networks that shows that a network is equivariant to a large group if and only if it is equivariant to smaller groups from which it is constructed.
no code implementations • 8 Nov 2020 • Taha Ameen ur Rahman, Alton S. Barbehenn, Xinan Chen, Hassan Dbouk, James A. Douglas, Yuncong Geng, Ian George, John B. Harvill, Sung Woo Jeon, Kartik K. Kansal, Kiwook Lee, Kelly A. Levick, Bochao Li, Ziyue Li, Yashaswini Murthy, Adarsh Muthuveeru-Subramaniam, S. Yagiz Olmez, Matthew J. Tomei, Tanya Veeravalli, Xuechao Wang, Eric A. Wayman, Fan Wu, Peng Xu, Shen Yan, Heling Zhang, Yibo Zhang, Yifan Zhang, Yibo Zhao, Sourya Basu, Lav R. Varshney
Many information sources are not just sequences of distinguishable symbols but rather have invariances governed by alternative counting paradigms such as permutations, combinations, and partitions.
Information Theory Information Theory
2 code implementations • ICLR 2021 • Sourya Basu, Govardana Sachitanandam Ramachandran, Nitish Shirish Keskar, Lav R. Varshney
Experiments show that for low values of k and p in top-k and top-p sampling, perplexity drops significantly with generated text length, which is also correlated with excessive repetitions in the text (the boredom trap).
1 code implementation • NIPS Workshop CDNNRIA 2018 • Sourya Basu, Lav R. Varshney
Deep neural networks have shown incredible performance for inference tasks in a variety of domains.
1 code implementation • 9 Apr 2018 • Sourya Basu, Lav R. Varshney
Deep neural networks have shown incredible performance for inference tasks in a variety of domains.