Home

I work as a Research Scientist with American Express AI Labs, Bangalore, India. At AMEX, we work on solving problems in FinTech area using Natural Language Processing and Machine Learning.

Previously, I was working as a PostDoctoral Researcher at School of Computing, Dublin City University, pursuing research in Computer Science and Engineering.

My areas of interest are Information Extraction and Retrieval, User Search Behavior, Natural Language Processing, Machine Learning, Sentiment Analysis and Deep Learning.

I worked in the ADAPT Centre at DCU in Dublin, Ireland.

My PhD supervisors were Dr. Gareth Jones and Dr. Jennifer Foster



"Share your knowledge.
It is a way to achieve immortality."
-- by Dalai Lama

Recent Updates and Notifications

Research Publications

A small list of published papers:

  1. Piyush Arora, and Gareth JF Jones "DCU at the NTCIR-14 OpenLiveQ-2 Task" (NTCIR, 2019)

  2. David Azcona, Piyush Arora, Hsiao I-Han, Alan Smeaton "user2code2vec: embeddings for profiling students based on distributional representations of source code" (LAK, 2019)

  3. Piyush Arora, Jennifer Foster and Gareth JF Jones "My PhD Thesis: Promoting user engagement and learning in search tasks by effective document representation" (2018, Dublin City University, Irelad)

  4. Piyush Arora and Gareth JF Jones "Identifying Useful and Important Information within Retrieved Documents" (CHIIR, 2017)

  5. Piyush Arora, Debasis Ganguly, Gareth JF Jones "Nearest Neighbour based Transformation Functions for Text Classification: A case study with StackOverflow" (ICTIR, 2016)

  6. Piyush Arora and Gareth JF Jones "Promoting User Engagement and Learning in Search Tasks By Effective Document Representation" (Search as Learning workshop, SIGIR 2016)

  7. Chris Hokamp and Piyush Arora,"DCU-SEManiacs at SemEval-2016 Task 1: Synthetic Paragram Embeddings for Semantic Textual Similarity" (SEMEVAL 2016, in association with NAACL 2016)

  8. Piyush Arora "Promoting User Engagement and Learning in Amorphous Search Tasks" (SIGIR 2015 Doctoral Consortium, Santiago Chile)

  9. Piyush Arora, Debasis Ganguly, Gareth JF Jones "The Good, the Bad and their Kins: Identifying Questions with Negative Scores in StackOverflow" (FAB 2015, in conjunction with ASONAM 2015)

  10. Piyush Arora, Chris Hokam, Jennifer Foster, Gareth JF Jones "DCU: Using Distributional Semantics and Domain Adaptation for the Semantic Textual Similarity SemEval-2015 Task 2" (SEMEVAL 2015, in association with NAACL 2015)

  11. Joachim Wagner, Piyush Arora, Santiago Cortes, Utsab Barman, Dasha Bogdanova, Jennifer Foster, Lamia Tounsi "DCU: Aspect-based Polarity Classification for SemEval Task 4" (SEMEVAL 2014, in association with COLING 2014)

Check full list of papers at: Google Scholar's Profile

Served on the programme committe of NAACL, ACL, ACM TALIP, EMNLP, ICON, SEMEVAL, ASICS.
I also served as the chair and editor of the MediaEval 2018 proceedings.

Courses

Some of the Advanced courses I have taken in IIIT-Hyderabad include the following:
  • Machine Learning
  • Information Retrieval and Extraction
  • Pattern Recognition
  • Data Warehousing and Data Mining
  • Artificial Intelligence
  • Natural Language Processing
  • NLP Applications

Some of the Interesting courses I have taken in IIIT-Hyderabad include the following:
  • Introduction to Cognitive Science
  • Intelligence: A Technology of Mind
  • Social and Technical Innovation
  • Gandhi and India
  • Readings from Hindi Literature
  • Corporate Strategy (Audited)
Apart from computer science and engineering, I like reading and diving into philosophy and humanities literature.

Conferences and Workshops attended

Some of the conferences, workshops and data meetups which I have attended:
  • 2019
    • Presented our paper on "DCU at the NTCIR-14 OpenLiveQ-2 Task" at NTCIR 14 held at Tokyo, Japan.
    • Gave a seminar talk at the National Institute of Informatics in Tokyo, Japan on 14th June, 2019 on our recent work in Neural IR and Multimodal retrieval at the ADAPT center.
  • 2018
    • Presented our work on "Semantic Search" at the Scientific Meeting held in the Trinity College, Dublin.
  • 2017
    • Presented My PhD work at HLF 2017
    • Presented our industrial collaboration work with Wolters Kluwer at the ADAPT Industry Showcase held at croke park, in Dublin, Ireland.
    • Attended and presented our paper at CHIIR 2017
    • Attended and presented our paper at SCST 2017 workshop, at CHIIR 2017: Supporting Complex Search Tasks
  • 2016
    • Presented a talk on applied NLP, at PyCon Ireland 2016 held on Sat 5th - Sun 6th November in Dublin
    • Presented our work at ADAPT Science day and ADAPT 2016 Industry showcase held at croke park, in Dublin, Ireland.
    • Presented a talk on our recent work on "Question Quality Prediction using Nearest Neighbourhood based Transformation Functions" at Dublin NLP Meetup
    • Attended SIGIR 2016 and presented our paper at SIGIR 2016, workshop: "Search As Learning"
    • Attended CHI2016 open house at Facebook Headquarters, Menlo Park
  • 2015
    • Attended Deep Learning Winter School, DL4MT 2015 held in Dublin, Ireland
    • Attended and Presented our work at SIGIR 2015, Doctor Consortium
    • Attended and Presented our work at FAB 2015 held in conjunction with ASONAM 2015
    • Attended and Presented our work at Semeval 2015 held in association with NAACL 2015
    • Participated in HackDCU 2015 and short listed among top 7 finalist out of 12 teams
  • 2014
    • Participated in DataKind Dublin Hackathon 2014: "Working for charities"
    • Attended WebSummit 2014 held in Dublin, Ireland
    • Attended PyCon 2014 held in Dublin, Ireland
    • Presented Brazilator sytem and poster at CNGL booth at COLING 2014 held in Dublin and at CNGL Industry Showcase held in Dublin
    • Attended COLING 2014 held in Dublin, Ireland
    • Attended Machine Learning School, LXMLS-2014 held in Lisbon, Portugal
  • 2013
  • 2012
  • 2011
  • 2010
  • 2009

Ongoing Projects

  • Cross Language MultiModal (Audio and Text) Retrieval
  • Exploring LearningToRank and Hybrid models for Information Retrieval
  • Document Summarization using Deep Learning

Completed (Ph.D Projects)

  • Effective Snippet generation for search engine result page (SERP's)
  • Analyzing user search behavior for complex search tasks
  • Measuring changes in user learning and topical knowledge in search tasks
  • Information need based thorough documents analysis
  • Exploring document level semantics for question quality prediction
  • Question Quality Prediction in Knowledge Forums: StackOverflow
  • Irish budget-2015 data analysis for RTE News: details
  • Semantic Similarity of Sentences and Documents
  • Investigating translation services (Google and Bing) for cross language information retrieval.
  • Live tweet translation and sentiment analysis system: BRAZILATOR in collaboration with Microsoft team.
  • Aspect Based Polarity Detection
  • Cross Language !ndian News Story Search
----Source code and link to the datasets for most of the completed projects is available on request.----

Past Projects (B.Tech and Masters Projects)

  • Mining Intent in Question Answer Forums
  • Sentiment Analysis on user-generated content focussing on product reviews and tweets
  • Subjective Lexicon Generation For Indian Languages
  • Unifying Google, Facebook and Twitter social data for enriched information about entities
  • WikiSearch: Mini Search Engine on Wikipedia
  • Transfer Grammar Engine For Sampark System
  • Urdu-Hindi Transliteration

How is life as a PhD student?

Some people describe it as a roller coaster ride and can be a nightmare for few. But I believe it's easy to comment in the end, when you are done to look back but hard to decipher when you are in between. For me, it has been a nice experience till now let see how things unfold in future. A nice detailed blog discussing different aspects about PhD: A Survival Guide to PhD.

What kind of work is expected in Ph.D?

A nice way of describing same visually: What is Ph.D ?.

How to learn basics of Information retrieval?

I will advice to follow two main books:Introduction to Information Retrieval and Modern Information Retrieval. In my opinion understand and try to question yourself as the basics remain same but there is a lot evolving both in theoretical and practical aspects.

Where can I learn more on User Search Behavior and complex search task?

I'm new to this area but some very useful and important pieces of recent papers, discussion can be checked at:
There are some nice groups working in this area from long time. Some starting pointers: Microsoft Research, Group at University of North Carolina, Group at Yahoo labs, Group at University of Tampere, Group at University of Glasgow.

Some standard benchmark competitions:

If you want to gain practical knowledge, start playing around with so many freely available standard benchmark datasets from different platforms such as: TREC, CLEF, NTCIR , FIRE, SEMEVAL etc.

Achievements

  • Our industrial collaborative project on Enhanced Supply Chain Performance & Risk Management Platform, won DCU INVENT 2018 commercialisaton award, in ICT category.
  • Our industrial collaborative project on Legal Semantic Search, won DCU INVENT 2017 commercialisaton award, in ICT category.
  • I was selected as one of the 200 young researchers to attend and participate in the 5th Heidelberg Lauretae Forum in September 2017, at Heidelberg in Germany
  • I was selected as one of the SIGIR student ambassadors to represent SIGIR at the 50 Years of the ACM Turing Award Celebration held in June 2017.

  • Participated and achieved good results (rank 14/45) and (rank 3/10) for Semantic Textual Similarity shared task for English and Spanish language respectively at SEMEVAL-2016
  • Our project idea StudentsForResearch got selected for UStart, 2015 startup accelerator programme run by DCU Ryan Academy
  • Participated and achieved average results (rank 26/74) for Semantic Textual Similarity shared task at SEMEVAL-2015
  • Came first in Aspect Based Polarity Detection shared task at SEMEVAL-2014
  • Came second in Cross Language Indian News Story Search shared task at FIRE-2013
  • Awarded with Research Award for academic year 2010-11
  • Dean Merit List (Spring 2010), Merit List (Monsoon 2010, Spring 2011)

Extra Curricular Activities

  • Represented DCU Squash and Volleyball Team at Irish Intervarsitities 2016 (definitely had some wondeful time in my final year with DCU team)
  • Runner-up in Badminton SSI-League 2014 and Plate winners in Irish Volleyball Intervarsities 2014: details
  • Banyan Award for dedicated services throughout the BTech life (July'07 - May'11)
  • Won prizes at various sports and extra curricular activities

Work Experience

Collaborative applied projects with following industrial partners

Startup Project

  • Research Engineer for our startup project funded by Ustart, startup accelerator programme run by DCU Ryan Academy (June'15-Sept'15)

At Dublin City University

  • PostDoctoral Researcher at ADAPT, Dublin City University, Ireland    (Sep'18-at present)
  • Research Assistant at ADAPT, Dublin City University, Ireland    (May'17- Aug'18)
  • Research Internship at CNGL, Dublin City University, Ireland    (Aug'12-Nov'12)
  • Tutor for Course Problem Solving, Creativity, and Design, (Sep'15-Dec'15, Sep'14-Dec'14, Sep'13-Dec'13)
  • Lab tutor for Course Search technologies, (Sep'16-Dec'16, Sep'15-Dec'15)
  • Tutor for Course Business Applications, (Jan'16-May'16, Jan'15-May'15)
  • Tutor for Course Data Structures and Algorithms, (Sep'14-Dec'14)
  • Tutor for Course Introduction to Java Programming, (Sep'13-May'14)

At IIIT-Hyderabad

  • Research Assistant at Language Technologies Research Center, (Jan'10 - July'12)
  • Summer Internship at Language Technologies Research Center, May-July('09 and '10)
  • Teaching Assistant for Course Information Retrieval and Extraction, (Jan'12 -May'12)
  • Teaching Assistant for Course Corporate Strategy, (Aug'11 - Dec'11)

Other responsibilities and positions held

At Dublin City University

  • Health and saftey officer for DCU Squash club (2016-2017)
  • Equipment officer for DCU Volleyball club (2015-2016)
  • Member of DCU Volleyball, Squash and Badminton Club (At present and in the past)

At IIIT-Hyderabad

  • Campus Life Secretary for 2 consecutive academic years (2009-2011).
  • Member of students parliament, IIIT-Hyderabad (2009-2011).
  • Coordinator for Fine-Arts Club (Monsoon-2009).
  • Coordinator for Design Team and Kalakshetra in Felicity 2010.
  • Campus Ambassador for Teach For India (2010-2011).
  • Coordinator and Member of Various Social Groups->

About Myself

I am a fun loving person. I am interested in learning and exploring more on how we can use technology to bring changes at the root level, how can we make better societies and ecosystem with the use of technology. I believe "Guided Technology" can play an important role in the path towards global sustainable world.

I like playing badminton, squash in my free time. I love going for long walks exploring the beautful nature, the precious gift that has been given to all of us. I love visiting new places, meeting new people and making friends. How can I forget the important part- I love listening music it's very dear to my heart.