Humanitarian & Social Informatics Lab, GMU
NSF III Small Collaborative Research Award # 1815459 & 1814958 Project: III: Small: Collaborative Research: Summarizing Heterogeneous Crowdsourced & Web Streams Using Uncertain Concept Graphs (Collaborative Grant Page)
Sponsor: National Science Foundation
Team: George Mason University (GMU), Vanderbilt University (VU)
Project & GMU Principal Investigator: Dr. Hemant Purohit
GMU Co-Principal Investigator: Dr. Huzefa Rangwala
VU Principal Investigator: Dr. Abhishek Dubey
VU Co-Principal Investigator: Dr. Gautam Biswas

About


Ubiquitous access to mobile and web technologies enables the public to share valuable information about their surroundings anywhere and anytime. For example, during an emergency or crisis people report needs from affected areas via social media as an alternative to the traditional 911 calls. This can be valuable information for a range of emergency service officials. However, the utilization of this data poses several computational challenges as it is generated in real time, is heterogeneous, highly unstructured, redundant, and sometimes unreliable. The project investigates new summarization approaches to handle noisy, unstructured data streams from multiple web sources in real time while accounting for the possibility of untrustworthy information, so that they can be fed into decision support systems of public services in a structured and machine-readable format. In addition, the project develops and validates robust decision support systems for allocating critical resources to needed areas based on the structured summary reports. The evaluation plan includes collaboration with emergency responders and the communities they serve. The broader impacts of this research include the design of a generic methodology to extract, integrate, and summarize structured information from big data streams on the web for helping public services of future smart cities. The research team plans to share simulated datasets with an open source system for real-time decision support during emergency response exercises. This can assist in workforce training and also, help design novel educational projects of data science for social good.

As such, we have worked on three sub-goals in the project.
a.) Formulate a representation of the uncertainties in incident reporting data and other environment reporting data that accompany an emergency incident event and construct an inference model to determine the likelihood of resource requirement in a region at a given time based on current reporting data. For this sub-goal, we aim to build a novel uncertain concept graph (UCG), which is a novel knowledge representation for structured summarization of unstructured data streams. It models key concepts of an application domain identified from domain knowledge ontologies as nodes, followed by modeling inference of relationships between concept nodes using information extraction over heterogeneous unstructured crowdsourced reporting data (Waze and Twitter).
b.) Design and evaluate a Bayesian analysis approach to infer relations between concepts in the uncertain concept graph in real-time, where the output of the analysis is a real-time update of resource requirements in a region over time. For this sub-goal, we aim to develop a novel algorithm by formulating this task as a consistency-based diagnosis problem that uses the social media/crowdsourced reporting data as possibly faulty sensors and further, develops a model to identify the likelihood of emergency incident events in a given region of the city. These likelihoods are then used to estimate the plausibility of the hypotheses generated by the diagnosis problem.
c.) Develop an action recommendation tool to support resource dispatch and stationing for emergency services based on the results from sub-goals 1 and 2. For this subgoal we aim to work with our local emergency response agencies to create open source software that can integrate existing data sources and provide recommendations on identifying critical resource requirements in the cities. For scoping, we focus on accident event detection and dispatch requirements of ambulances and rescue trucks as resources.

Thesis and Dissertations


  • Jitin Krishnan. Unsupervised Cross-Domain and Cross-Lingual Methods for Text Classification, Slot Filling, & Question-Answering. George Mason University, Fairfax, USA, 2021.


Selected Publications


  • Senarath, Yasas and Mukhopadhyay, Ayan and Vazirizade, Sayyed Mohsen and Purohit, Hemant and Nannapaneni, Saideep and Dubey, Abhishek. (2021). Practitioner-Centric Approach for Early Incident Detection Using Crowdsourced Data for Emergency Services. 2021 IEEE International Conference on Data Mining (ICDM). 1318-1323.

  • Pandey, Rahul and Purohit, Hemant and Castillo, Carlos and Shalin, Valerie L.. (2022). Modeling and mitigating human annotation errors to design efficient stream processing systems with human-in-the-loop machine learning. International Journal of Human-Computer Studies. 160 (C) 102772.

  • Krishnan, Jitin and Anastasopoulos, Antonios and Purohit, Hemant and Rangwala, Huzefa. (2021). Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling. 1st Workshop on Multilingual Representation Learning at ACL-2021. 211-223.

  • Senarath, Yasas and Nannapaneni, Saideep and Purohit, Hemant and Dubey, Abhishek. (2020). Emergency Incident Detection from Crowdsourced Waze Data using Bayesian Information Fusion. 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT). 187-194.

  • Purohit, Hemant and Shalin, Valerie L. and Sheth, Amit P. and Sheth, Amit. (2020). Knowledge Graphs to Empower Humanity-Inspired AI Systems. IEEE Internet Computing. 24 (4), 48-54.

  • Krishnan, Jitin and Purohit, Hemant and Rangwala, Huzefa. "Diversity-Based Generalization for Neural Unsupervised Text Classification under Domain Shift," ECML-PKDD, 2020.

  • Purohit, Hemant and Peterson, Steve. (2020). Social Media Mining for Disaster Management and Community Resilience. Big Data in Emergency Management: Exploitation Techniques for Social and Mobile Data, Ed: Akerkar, R. Springer.

  • Purohit, Hemant and Castillo, Carlos and Pandey, Rahul. "Ranking and grouping social media requests for emergency services using serviceability model," Social Network Analysis and Mining, v.10, 2020. doi:10.1007/s13278-020-0633-3

  • Mukhopadhyay, Ayan and Pettet, Geoffrey and Vazirizade, Sayyed and Vorobeychik, Yevgeniy and Kochenderfer, Mykel and Dubey, Abhishek. A Review of Emergency Incident Prediction, Resource Allocation and Dispatch Models. Accident Analysis & Prevention, 2022.

  • Purohit, Hemant and Kanagasabai, Rajaraman and Deshpande, Nikhil. "Towards Next Generation Knowledge Graphs for Disaster Management," The 13th IEEE International Conference on Semantic Computing (ICSC), 2019. doi:10.1109/ICOSC.2019.8665638

  • Pandey, Rahul and Castillo, Carlos and Purohit, Hemant. "Modeling Human Annotation Errors to Design Bias-Aware Systems for Social Stream Processing," The 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). 2019.

  • Chan, Jennifer L. and Purohit, Hemant. "Challenges to Transforming Unconventional Social Media Data into Actionable Knowledge for Public Health Systems During Disasters," Disaster Medicine and Public Health Preparedness, 2019.

  • Purohit, Hemant and Castillo, Carlos and Imran, Muhammad and Pandey, Rahul. "Ranking of Social Media Alerts with Workload Bounds in Emergency Operation Centers," The 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI), 2018. doi:10.1109/WI.2018.00-88

  • Pandey, Rahul and Bahl, Gaurav and Purohit, Hemant. "EMAssistant: A Learning Analytics System for Social and Web Data Filtering to Assist Trainees and Volunteers of Emergency Services," The 16th International Conference on Information Systems for Crisis Response And Management (ISCRAM), 2019.

  • Purohit, Hemant and Dubrow, Samantha and Bannan, Brenda. "Designing a Multimodal Analytics System to Improve Emergency Response Training," The 21st International Conference on Human-Computer Interaction, 2019.

  • Purohit, Hemant and Nannapaneni, Saideep and Dubey, Abhishek and Karuna, Prakruthi and Biswas, Gautam. "Structured Summarization of Social Web for Smart Emergency Services by Uncertain Concept Graph," 2018 IEEE International Science of Smart City Operations and Platforms Engineering in Partnership with Global City Teams Challenge (SCOPE-GCTC), 2018. doi:10.1109/SCOPE-GCTC.2018.00012

  • Samal, Chinmaya and Dubey, Abhishek and Ratliff, Lillian. "Mobilytics-Gym: A simulation framework for analyzing urban mobility decision strategies," The 2019 IEEE International Conference on Smart Computing (SMARTCOMP), 2019. doi:10.1109/SMARTCOMP.2019.00064

  • Mukhopadhyay, Ayan and Pettet, Geoffrey and Samal, Chinmaya and Dubey, Abhishek and Vorobeychik, Yevgeniy. "An online decision-theoretic pipeline for responder dispatch," The 10th ACM/IEEE International Conference on Cyber-Physical Systems, 2019. doi:10.1145/3302509.3311055

People


Faculty:
- Dr. Hemant Purohit, GMU
- Dr. Huzefa Rangwala, GMU
- Dr. Abhishek Dubey, VU
- Dr. Gautam Biswas, VU

Students:
- Jitin Krishnan (PhD research assistant, GMU; Status: graduated and currently at Facebook Research)
- Rahul Pandey (PhD research assistant, GMU; Status: current)
- Yasas Senarath (PhD research assistant, GMU; Status: current)
- Prashanti Maktala (MS research assistant, GMU; Status: graduated)

Contact


If interested in this project and want to pursue PhD or MS Thesis, then you can mail at h p u r o h i t _a_t_ g m u _d_o_t_ e d u with your resume, transcripts, and any prior research papers.