Meetings


Goal#

target:
    1000 unique workplace meeting utterances

scope:
    opening
    agenda
    clarification
    interruption
    agreement / disagreement
    decision
    action item
    closing

Source Plan#

Source Use
https://groups.inf.ed.ac.uk/ami/icsi/ meeting corpus / transcripts
https://catalog.ldc.upenn.edu/LDC2004T04 ICSI meeting transcript metadata
https://www.federalreserve.gov/monetarypolicy/fomc_historical.htm public meeting transcripts
https://www.sec.gov/newsroom/speeches-statements public hearing / meeting transcripts
https://www.nato.int/en/news-and-events/transcripts public meeting remarks

Collection Rules#

do:
    collect short utterances only
    deduplicate exact text
    keep source_url for every utterance
    keep speaker/context when available
    tag by meeting function

do not:
    paste long transcript blocks
    import closed-license corpus text without permission
    mix generated sentences into this file

Corpus Schema#

id utterance tag source_url source_license note