{"id":863,"date":"2020-08-31T14:53:35","date_gmt":"2020-08-31T14:53:35","guid":{"rendered":"https:\/\/data-science.gotoauthority.com\/2020\/08\/31\/linguistic-fundamentals-for-natural-language-processing-100-essentials-from-semantics-and-pragmatics\/"},"modified":"2020-08-31T14:53:35","modified_gmt":"2020-08-31T14:53:35","slug":"linguistic-fundamentals-for-natural-language-processing-100-essentials-from-semantics-and-pragmatics","status":"publish","type":"post","link":"https:\/\/wealthrevelation.com\/data-science\/2020\/08\/31\/linguistic-fundamentals-for-natural-language-processing-100-essentials-from-semantics-and-pragmatics\/","title":{"rendered":"Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Semantics and Pragmatics"},"content":{"rendered":"<div id=\"post-\">\n<p><b>By <a href=\"http:\/\/faculty.washington.edu\/ebender\/\" target=\"_blank\" rel=\"noopener noreferrer\">Emily Bender<\/a>, Professor of Linguistics at the University of Washington<\/b>.<\/p>\n<p><img src=\"https:\/\/www.morganclaypoolpublishers.com\/catalog_Orig\/images\/9781681730738.png%20\" alt=\"https:\/\/www.morganclaypoolpublishers.com\/catalog_Orig\/images\/9781681730738.png \" width=\"250\" align=\"right\">Natural language processing (NLP), including text analytics, text as data, etc., involves the application of machine learning and other methods to text (and speech) in some natural language. For the most part, data scientists working with NLP techniques are interested in the information that is stored in written English (or, more rarely, it seems, other languages). However, to get at this information requires building or at least using algorithms that model the structures of language and their relationship to the meanings expressed.<\/p>\n<p>In 2013, I published <a href=\"http:\/\/www.morganclaypool.com\/doi\/abs\/10.2200\/S00493ED1V01Y201303HLT020\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Linguistic Fundamentals for Natural Language Processing: 100 Essentials from Morphology and Syntax<\/em><\/a>, which was designed to provide an accessible overview of what the field of linguistics can teach NLP about linguistic structure (morphology and syntax). This book was reviewed by <a href=\"https:\/\/link.springer.com\/article\/10.1007\/s10590-014-9149-9\">Francis Tyers in <\/a><a href=\"https:\/\/link.springer.com\/article\/10.1007\/s10590-014-9149-9\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Machine Translation<\/em><\/a> in 2014 and by <a href=\"https:\/\/www.aclweb.org\/anthology\/J15-1007.pdf\">Chris Dyer in <\/a><a href=\"https:\/\/www.aclweb.org\/anthology\/J15-1007.pdf\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Computational Linguistics<\/em><\/a> in 2015.<\/p>\n<p>But structure is only part of the equation. In 2019, I teamed up with <a href=\"http:\/\/homepages.inf.ed.ac.uk\/alex\/\" target=\"_blank\" rel=\"noopener noreferrer\">Alex Lascarides<\/a> to produce a companion volume, <a href=\"http:\/\/www.morganclaypoolpublishers.com\/catalog_Orig\/product_info.php?products_id=1451&amp;fbclid=IwAR1veU0G8gOnTKPEKozqBiMsjEU8QLSK006SasaW457HvyY6HBjsk8IxiY0\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Linguistic Fundamentals for Natural Language Processing II: 100 Essentials from Semantics and Pragmatics<\/em><\/a>, which similarly provides an overview of how meaning is encoded in human languages and how people use those encoded meanings for communicative ends. Both books follow a format of 100 short vignettes, illustrated with specific examples from many different languages, with the goal of making complex ideas approachable.<\/p>\n<p><img class=\"aligncenter\" src=\"\/images\/bender-syntax-tree-700.jpg\" alt=\"Bender Syntax Tree\" width=\"90%\"><\/p>\n<p>The table of contents (made up of the headlines of all 100 vignettes) and the first two chapters (\u201cIntroduction\u201d and \u201cWhat is Meaning?\u201d) can be found <a href=\"https:\/\/www.morganclaypoolpublishers.com\/catalog_Orig\/samples\/9781681730745_sample.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">here<\/a>. Or, if you want it in even shorter form, here are <a href=\"https:\/\/twitter.com\/emilymbender\/status\/1171105042268426243\">tweet-thread serializations of a few of the vignettes<\/a>, including \u201c<a href=\"https:\/\/twitter.com\/emilymbender\/status\/1171105106416164864\" target=\"_blank\" rel=\"noopener noreferrer\">#3: Natural language understanding requires commonsense reasoning<\/a>\u201d, \u201c<a href=\"https:\/\/twitter.com\/emilymbender\/status\/1171800460971274240\" target=\"_blank\" rel=\"noopener noreferrer\">#8 Linguistic meaning includes emotional content<\/a>\u201d, \u201c<a href=\"https:\/\/twitter.com\/emilymbender\/status\/1186372304961277952\" target=\"_blank\" rel=\"noopener noreferrer\">#19: Regular polysemy describes productively related sense families<\/a>\u201d, and \u201c<a href=\"https:\/\/twitter.com\/emilymbender\/status\/1187355978355728384\" target=\"_blank\" rel=\"noopener noreferrer\">#30: Words can have surprising nonce uses through meaning transfer<\/a>\u201d.<\/p>\n<p><img class=\"aligncenter\" src=\"\/images\/bender-sandy-paris-700.jpg\" alt=\"Bender Sandy Paris\" width=\"90%\"><\/p>\n<p>Curious what a nonce use is? #30 is where we cover how it is that sentences like <em>The ham sandwich and salad at Table 7 is getting impatient<\/em> are even ever meaningful. Other vignettes in the book include \u201c#39 Collocations are often less ambiguous than the words taken in isolation\u201d, \u201c#62 Evidentials encode the source a speaker credits the information to and\/or the degree of certainty the speaker feels about it\u201d, and \u201c#95 Silence can be a meaningful act\u201d.<\/p>\n<p><img class=\"aligncenter\" src=\"\/images\/bender-dogs-carried-700.jpg\" alt=\"Bender Dogs Carried\" width=\"90%\"><\/p>\n<p><strong>Bio:<\/strong>\u00a0<a href=\"http:\/\/faculty.washington.edu\/ebender\/\" target=\"_blank\" rel=\"noopener noreferrer\">Emily M. Bender<\/a> (<a href=\"https:\/\/twitter.com\/emilymbender\" target=\"_blank\" rel=\"noopener noreferrer\">@emilymbender<\/a>) is a Professor of Linguistics at the University of Washington and the Faculty Director of the Professional Masters in Computational Linguistics (CLMS) program. Her research interests include the interaction of linguistics and NLP, computational semantics, multilingual NLP, and the societal impact of language technology.<\/p>\n<p><b>Related:<\/b><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/www.kdnuggets.com\/2020\/08\/linguistic-fundamentals-natural-language-processing.html<\/p>\n","protected":false},"author":0,"featured_media":864,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[2],"tags":[],"_links":{"self":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/posts\/863"}],"collection":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/comments?post=863"}],"version-history":[{"count":0,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/posts\/863\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/media\/864"}],"wp:attachment":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/media?parent=863"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/categories?post=863"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/tags?post=863"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}