Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Blog Post number 4

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blogging

less than 1 minute read

Sometimes, I blog about ML, NLP (or other) topics on Medium and Wordpress (although my Wordpress site has not been updated since 2020.:( )

Blog Post number 3

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

experience

Summer Intern

Data and AI in Microsoft Azure, Microsoft

Duration: 27 May 2019 - 27 July 2019

I worked on developing a serverless framework for smart city elements like smart traffic, smart parking, etc.

Technical Account Manager

Data and AI in Microsoft Azure, Microsoft

Duration: 20 Jun 2020 - 6 Aug 2021

I facilitated solution building, cloud consumption, digital transformation and project orchestration primarily in Microsoft Azure Data and AI for customers in banking and finance

NLP Research Intern

Amit Awekar's Lab, Indian Institute of Technology Guwahati

Duration: 15 Oct 2020 - 15 May 2021

I worked with Dr. Amit Awekar on representational learning in word embedding algorithms and multilinguality

Graduate Researcher

Wei Xu's lab, Georgia Tech

Duration: 10 Aug 2021 - 10 Dec 2021

I worked on natural language processing in social media, specifically focussing on PTSD patients

Summer Research Intern

Pennebaker Language Lab, The University of Texas at Austin

Duration: 20 May 2022 - Current

I am working with Dr. James W. Pennebaker on natural language understanding in social media, specifically analyzing psychological, linguistic and topical dimensions of data

Fall Research Intern

Multilingual Technologies (MLT) Lab, German Research Center for Artificial Intelligence (DFKI) and Universität des Saarlandes (UdS)

Duration: 15 Aug 2022 - Current

I am working with Dr. Josef van Genabith and Dr. Cristina Espãna Bonet on multilinguality, intepretability and model analysis of neural networks in machine translation.

misc

Travelling and Outdoors

Love travelling and spending time outdoors

This is my first time in Europe, so hit me up if you have any suggestions about places that I could visit! I stay in Saarbrücken, which is in southwestern Germany.

Singing

All genres :P

Singing is my hobby, but I am also trained in North Indian (Hindustani) Classical Music (for 5 years - completed Visharad-II which is considered as masters in Indian Classical Music). I sing and listen to almost all genres, but my personal favorites are Classical, Pop, and R&B.

Technical Blogging

Blogging about NLP, ML and things that interests me!

I occasionally blog on Medium and Wordpress, although my Wordpress site has not been updated since 2020.:(

portfolio

projects

Investigating the Impact of Chronic Illness on Cognitive Biases

Cognitive biases impact the decision-making of patients with a chronic illness. We analyzed the presence of language patterns and psycholinguistic markers characteristic of cognitive biases within Reddit posts by individuals who have self-disclosed a chronic illness using visual, interrupted time series and n-gram analyses of associated LIWC dimensions. (Report Link)




publications

KTRICT A KAZE Feature Extraction: Tree and Random Projection Indexing-Based CBIR Technique

Published in International Journal of Multimedia Data Engineering and Management (IJMDEM), 2020

We introduce a technique called KTRICT, which uses random projection based indexing and improves the performance of the CBIR system by significantly reducing the overall retrieval time.

Recommended citation: Badal Soni, Angana Borah, Pidugu Naga Lakshmi Sowgandhi, Pramod Sarma and Ermyas Fekadu Shiferaw. (2020). "Are Word Embedding Methods Stable and Should We Care About It?" International Journal of Multimedia Data Engineering and Management (IJMDEM) . 11(2).

A survey on the roles of Bloom Filter in implementation of the Named Data Networking

Published in Computer Networks Journal, 2021

A survey paper which presents a detailed discussion on the role of Bloom Filter in implementing Named Data Netowkring, a future internet architecture which handles billions of requests by modifying the existing IP architecture.

Recommended citation: Sabuzima Nayak*, Ripon Patgiri*, Angana Borah* " A survey on the roles of Bloom Filter in implementation of the Named Data Networking. Volume 196, 4 September 2021, 108232.

Are Word Embedding Methods Stable and Should We Care About It?

Published in ACM HyperText, 2021

The central idea of this paper is to explore the stability measurement of WEMs using intrinsic evaluation based on word similarity and observe the effect of stability on two downstream tasks like Clustering and Fairness evaluation.

Recommended citation: Angana Borah, Manash Pratim Barman, Amit C Awekar. (2021). "Are Word Embedding Methods Stable and Should We Care About It?" In Proceedings of the 32nd ACM Conference on Hypertext and Social Media . (pp. 45-55).

talks

teaching

Teaching experience 1

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.