The workshop agenda just below is divided into two parts. In the morning, we will review language resources needs from the perspective of researchers and funding agencies, summarize current language resource efforts and models, and examine two complementary models in some detail (LDC and CSLU). We will discuss these topics, and review and/or revise the topics for afternoon breakout groups.
After lunch, the workshop turns into a WORKshop. The first set of breakout groups will consider what language resources are needed to advance human language technology. The second set of breakout groups will consider models for funding, developing and distributing these resources. The participants will then present their recommendations, outline a report that incorporates them, and assign responsibility for authoring the sections. It is expected that this report will have a major impact on the formulation of new policies and initiatives.
|6:30 AM –
|8:00 AM||Welcome||Gary Strong|
|8:10 AM||Welcome and overview of workshop agenda||Ron Cole|
|8:20 AM||Language resources for NSF||Gary Strong|
|8:40 AM||Language resources for DARPA||Alan Sears|
|9:00 AM||Language resources for DoD||Lynn Carlson|
|9:40 AM||LDC and language resources||Mark Liberman|
|10:30||CSLU’s model for language resources||Ron Cole|
|11:20 AM||Resources for word sense identification||George Miller|
|11:40 AM||“The Discourse Initiative”||Suzann Luperfoy|
|12:00 Noon||Discussion and tasking for first breakouts.||–|
|12:30 PM||Working lunch and first breakouts. Visit lunch buffet and take lunch to your breakout session. Return with maximum two foils for plenary briefing||–|
|2:15 PM||First breakout reports||–|
|2:45 PM||Select topics for second breakouts||–|
|3:30 PM||Second breakouts: what models?||–|
|5:00 PM||Second breakout reports: 5 minutes each||Mark Liberman|
|5:45 PM||Workshop Report
(1) Generate outline of final report
(2) Assign authors to sections and subsections
(3) Establish timelines, etc.
|6:45 PM||Reception and dinner||–|
The overarching question to be addressed by the first set of breakouts is: What language resources are needed to support our National agenda? This question should be addressed in the broadest sense– language resources include (annotated) written and spoken corpora, static images, videos of people conversing, creation of standards and evaluation methodologies, as well as tools for creating, learning about and developing language resources and technologies.
The following proposed breakout topics are grouped in terms of a general model of information retrieval– requesting information; locating information; organizing information; and presenting information. Within each group, we ask participants to address the issues of (a) data resources (e.g., annotated corpora, lexicons, images, videos); (b) tools and technologies; (c) standards and evaluation metrics.
METHODS AND MODELS
The overarching issue of the second set of breakouts is: How do we fund, develop and distribute language resources to all who need them? One possible set of topics is to divide the world into
Additional topics might include: