User Tools

Site Tools


wiki:welcome

Big Crisis Data - Resources

Datasets

Corpora of social media messages for crises/disasters research.

Software for natural language processing (NLP)

Programs and libraries for tokenization, part-of-speech tagging, entity extraction, entity linking, and other NLP operations.

In Java:

  • WEKA: open-source data mining software in Java.
  • MALLET: natural language processing and topic modeling.
  • Apache OpenNLP: natural language processing.
  • GATE: text processing.
  • ArkNLP: Twitter-specific natural language processing.

In Python:

  • NLTK: natural language toolkit.

Language-agnostic:

Online:

Software for geographical information systems

Geotagging:

Free maps:

  • OpenStreetMap: geographical information, useful for building gazeteers.

Software for crowdsourcing

Free/open systems

Integrated systems that are open and free.

Venues

Past and present related conferences and workshops:

  • SWDM'16: Social Web for Disaster Management
  • ISCRAM'16: Information Systems for Crisis Response and Management
  • SAFE'15: Workshop on Semantics and Analytics for Emergency Response
  • KDD-LESI'14: Workshop on Learning About Emergencies from Social Information.
  • AAAI Spring Symposium'15: Structured Data for Humanitarian Technologies.
  • HUMTEC'16: Humanitarian Technologies

Volunteer organizations

Blogs and social media

Blogs and social media accounts of researchers and practitioners working in social innovation in general, and/or social media for disasters in particular.

Talks

Talks about big data for emergency/disaster management, and about social media data for research in general.

Errata

  • The Arabic example on page 39, it says “al3ab”, should say “al7ob” (thanks to Sallam Abualhaija).
wiki/welcome.txt · Last modified: 2018/08/25 06:45 by 92.185.54.190