Summer Intern
Data and AI in Microsoft Azure, Microsoft
Duration: 27 May 2019 - 27 July 2019
I worked on developing a serverless framework for smart city elements like smart traffic, smart parking, etc.
Data and AI in Microsoft Azure, Microsoft
Duration: 27 May 2019 - 27 July 2019
I worked on developing a serverless framework for smart city elements like smart traffic, smart parking, etc.
Data and AI in Microsoft Azure, Microsoft
Duration: 20 Jun 2020 - 6 Aug 2021
I facilitated solution building, cloud consumption, digital transformation and project orchestration primarily in Microsoft Azure Data and AI for customers in banking and finance
Amit Awekar's Lab, Indian Institute of Technology Guwahati
Duration: 15 Oct 2020 - 15 May 2021
I worked with Dr. Amit Awekar on representational learning in word embedding algorithms and multilinguality
Wei Xu's lab, Georgia Tech
Duration: 10 Aug 2021 - 10 Dec 2021
I worked on natural language processing in social media, specifically focussing on PTSD patients
Pennebaker Language Lab, The University of Texas at Austin
Duration: 20 May 2022 - Current
I am working with Dr. James W. Pennebaker on natural language understanding in social media, specifically analyzing psychological, linguistic and topical dimensions of data
Multilingual Technologies (MLT) Lab, German Research Center for Artificial Intelligence (DFKI) and Universität des Saarlandes (UdS)
Duration: 15 Aug 2022 - Current
I am working with Dr. Josef van Genabith and Dr. Cristina Espãna Bonet on multilinguality, intepretability and model analysis of neural networks in machine translation.
Love travelling and spending time outdoors
This is my first time in Europe, so hit me up if you have any suggestions about places that I could visit! I stay in Saarbrücken, which is in southwestern Germany.
All genres :P
Singing is my hobby, but I am also trained in North Indian (Hindustani) Classical Music (for 5 years - completed Visharad-II which is considered as masters in Indian Classical Music). I sing and listen to almost all genres, but my personal favorites are Classical, Pop, and R&B.
Blogging about NLP, ML and things that interests me!
I occasionally blog on Medium and Wordpress, although my Wordpress site has not been updated since 2020.:(
Short description of portfolio item number 1
Short description of portfolio item number 2
We explored data manipulation techniques, explainability, prompt engineering, and unsupervised approaches (using attention matrices) to detect and mitigate hallucinations in Neural Machine Translation. (Repo Link)
Cognitive biases impact the decision-making of patients with a chronic illness. We analyzed the presence of language patterns and psycholinguistic markers characteristic of cognitive biases within Reddit posts by individuals who have self-disclosed a chronic illness using visual, interrupted time series and n-gram analyses of associated LIWC dimensions. (Report Link)
A Visual Question Answering System to help the visually impaired with navigation. We explored large pre-trained models (VGGNet and BERT) for images and text to answer queries by the visually impaired. (Link to project site)
We analyzed different distance metrics including static and learned distance metrics in Prototypical Networks. (Link to paper)
We analyzed patterns in mental discourse of the student community in Reddit. We performed topic modeling and time series modeling and analysis that quantifies relationships between the qualitative topics. (Paper Link)
We used social media data from Reddit to understand
audience’s perception about biases in movies.
(Presentation Link)
Published in International Journal of Multimedia Data Engineering and Management (IJMDEM), 2020
We introduce a technique called KTRICT, which uses random projection based indexing and improves the performance of the CBIR system by significantly reducing the overall retrieval time.
Recommended citation: Badal Soni, Angana Borah, Pidugu Naga Lakshmi Sowgandhi, Pramod Sarma and Ermyas Fekadu Shiferaw. (2020). "Are Word Embedding Methods Stable and Should We Care About It?" International Journal of Multimedia Data Engineering and Management (IJMDEM) . 11(2).
Published in Computer Networks Journal, 2021
A survey paper which presents a detailed discussion on the role of Bloom Filter in implementing Named Data Netowkring, a future internet architecture which handles billions of requests by modifying the existing IP architecture.
Recommended citation: Sabuzima Nayak*, Ripon Patgiri*, Angana Borah* " A survey on the roles of Bloom Filter in implementation of the Named Data Networking. Volume 196, 4 September 2021, 108232.
Published in ACM HyperText, 2021
The central idea of this paper is to explore the stability measurement of WEMs using intrinsic evaluation based on word similarity and observe the effect of stability on two downstream tasks like Clustering and Fairness evaluation.
Recommended citation: Angana Borah, Manash Pratim Barman, Amit C Awekar. (2021). "Are Word Embedding Methods Stable and Should We Care About It?" In Proceedings of the 32nd ACM Conference on Hypertext and Social Media . (pp. 45-55).
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.