{"id":561,"date":"2020-08-21T14:49:21","date_gmt":"2020-08-21T14:49:21","guid":{"rendered":"https:\/\/data-science.gotoauthority.com\/2020\/08\/21\/must-read-nlp-and-deep-learning-articles-for-data-scientists\/"},"modified":"2020-08-21T14:49:21","modified_gmt":"2020-08-21T14:49:21","slug":"must-read-nlp-and-deep-learning-articles-for-data-scientists","status":"publish","type":"post","link":"https:\/\/wealthrevelation.com\/data-science\/2020\/08\/21\/must-read-nlp-and-deep-learning-articles-for-data-scientists\/","title":{"rendered":"Must-read NLP and Deep Learning articles for Data Scientists"},"content":{"rendered":"<div id=\"post-\">\n<p><img class=\"aligncenter size-full wp-image-115069\" src=\"https:\/\/www.kdnuggets.com\/wp-content\/uploads\/Fig1-Ambalina-must-read-nlp-deep-learning-articles.jpg\" alt=\"\" width=\"90%\"><\/p>\n<p>As always, the fields of deep learning and natural language processing are as busy as ever. Despite many industries being hindered by the quarantine restrictions in many countries, the machine learning industry continues to move forward. It seems almost every week, new models are being released, and new startups are showing off AI-powered technologies that will help build a better world. In this article, we will briefly go over some of the biggest recent news in NLP and deep learning, as well as some must-read guides, feature articles, tools, resources, and datasets you may want to check out.<\/p>\n<p>\u00a0<\/p>\n<h3>NLP &amp; Deep Learning News<\/h3>\n<p>\u00a0<\/p>\n<ol>\n<li><strong><a href=\"https:\/\/towardsdatascience.com\/how-deep-learning-can-keep-you-safe-with-real-time-crime-alerts-95778aca5e8a\" target=\"_blank\" rel=\"noopener noreferrer\">How Deep Learning Can Keep You Safe with Real-Time Crime Alerts<\/a><\/strong><\/li>\n<\/ol>\n<p>From Nikunj Aggarwal, the Machine Learning Lead at Citizen, this article gives us a great example of how deep learning is being used to create life-changing (or life-saving) technologies. Citizen is an emergency and safety alert app that warns people of incidents and crimes that have taken place in their area in real-time.<\/p>\n<p><img class=\"aligncenter size-large\" src=\"https:\/\/miro.medium.com\/max\/668\/0*OVZb-cjvoncr9tTg\" width=\"90%\"><\/p>\n<p><em>Image from <a href=\"http:\/\/citizen.com\/mission\" target=\"_blank\" rel=\"noopener noreferrer\">Citizen<\/a>.<\/em><\/p>\n<p>The company used a speech-to-text engine and a convolutional neural network to analyze first responder radio frequencies. In doing so, the company was able to scale their app to multiple cities in the United States. This technology could mark a huge change in the police and first responder infrastructure in years to come.<\/p>\n<ol start=\"2\">\n<li><strong><a href=\"https:\/\/openai.com\/blog\/openai-api\/\" target=\"_blank\" rel=\"noopener noreferrer\">The Release of Open AI API<\/a><\/strong><\/li>\n<\/ol>\n<p>The release of GPT-3 by Open AI was likely the biggest news in the field of NLP this year. However, what many people may have missed is the release of Open AI\u2019s API. The purpose of the API is to give people access to future models developed by the company, including GPT-3. This is big news, as it marks a shift for the company\u2019s normal practices of open-sourcing their models (as they did with GPT-2). In the article, the company explains why they decided to release a commercial product, why they went away from open-source this time around, and how they will control potential misuse of their API.<\/p>\n<ol start=\"3\">\n<li><strong><a href=\"https:\/\/www.theverge.com\/2020\/6\/8\/21284683\/ibm-no-longer-general-purpose-facial-recognition-analysis-software\" target=\"_blank\" rel=\"noopener noreferrer\">IBM will no longer offer, develop, or research facial recognition technology<\/a><\/strong><\/li>\n<\/ol>\n<p>In <a href=\"https:\/\/www.ibm.com\/blogs\/policy\/facial-recognition-susset-racial-justice-reforms\/\" target=\"_blank\" rel=\"noopener noreferrer\">a letter to congress<\/a>, the CEO of IBM publicly stated that the company would be halting development and service offerings of general-purpose facial recognition technology.<\/p>\n<p><img class=\"aligncenter size-full wp-image-115070\" src=\"https:\/\/www.kdnuggets.com\/wp-content\/uploads\/Fig2-Ambalina-must-read-nlp-deep-learning-articles.jpg\" alt=\"\" width=\"90%\"><\/p>\n<p>This was a huge step for the company and a big message to the data science community as a whole. IBM\u2019s move to prioritize ethics and safety might have encouraged other large tech companies (including <a href=\"https:\/\/www.washingtonpost.com\/technology\/2020\/06\/11\/microsoft-facial-recognition\/\" target=\"_blank\" rel=\"noopener noreferrer\">Microsoft<\/a>) to do the same.<\/p>\n<ol start=\"4\">\n<li><strong><a href=\"https:\/\/ai.googleblog.com\/2020\/07\/introducing-model-card-toolkit-for.html\" target=\"_blank\" rel=\"noopener noreferrer\">Introducing the Model Card Toolkit for Easier Model Transparency Reporting<\/a><\/strong><\/li>\n<\/ol>\n<p>With the creation of larger and possibly more complicated deep learning models, it becomes increasingly difficult to explain their intended use cases and other information to users downstream. To help solve this problem, researchers at Google have developed the \u201cModel Card Toolkit\u201d to help make model transparency reports easier to create.<\/p>\n<p>\u00a0<\/p>\n<h3>Bonus Machine Learning News<\/h3>\n<p>\u00a0<\/p>\n<ol start=\"5\">\n<li><strong><a href=\"https:\/\/medium.com\/discourse\/you-dont-need-college-anymore-says-google-102d4beec668\" target=\"_blank\" rel=\"noopener noreferrer\">You Don\u2019t Need College Anymore, Says Google<\/a><\/strong><\/li>\n<\/ol>\n<p>Do you need a Ph.D. to work in data science? Well, Google\u2019s new certification program may change the game. On July 14th, 2020, Google announced their new professional certification programs in the fields of UX design, project management, and <strong>data analysis.\u00a0<\/strong><\/p>\n<p>Whether or not a Google Certificate in data analysis will be enough to land you a job at a data science team is yet to be determined. However, a certification from the largest tech company in the world may end up being worth more than a 4-year degree.<\/p>\n<ol start=\"6\">\n<li><strong><a href=\"https:\/\/curia.europa.eu\/jcms\/upload\/docs\/application\/pdf\/2020-07\/cp200091en.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">The Court of Justice invalidates Decision 2016\/1250 on the adequacy of the protection provided by the EU-US Data Protection Shield<\/a><\/strong><\/li>\n<\/ol>\n<p>In July of 2020, a big decision was made by the Court of Justice of the European Union that may greatly affect data transfer between Europe and the United States. Essentially, the decision was made to invalidate \u201cDecision 2016\/1250\u201d, which fostered in a data transfer agreement titled the \u201cEU-U.S. Privacy Shield\u201d.<\/p>\n<p>Instead, those whose data are transferred to a country not in the EU, must be afforded \u201ca level of protection essentially equivalent to that guaranteed within the EU by the GDPR.\u201d This means that if a company like Tik Tok wants to transfer data from users in the EU to be processed on servers in the United States, authorities have the responsibility to prohibit this data transfer if they deem that data privacy and security measures in the United States don\u2019t comply with GDPR standards.<\/p>\n<p>If you want to learn more about this, <a href=\"https:\/\/techcrunch.com\/2020\/07\/16\/europes-top-court-strikes-down-flagship-eu-us-data-transfer-mechanism\/\" target=\"_blank\" rel=\"noopener noreferrer\">TechCrunch\u2019s article<\/a> is a much easier read than the actual legal document.<\/p>\n<p>\u00a0<\/p>\n<h3>Deep Learning Guides &amp; Feature Articles<\/h3>\n<p>\u00a0<\/p>\n<ol start=\"7\">\n<li><strong><a href=\"https:\/\/towardsdatascience.com\/deep-learning-algorithms-the-complete-guide-e4e7c535b2fc\">Deep Learning Algorithms \u2014 The Complete Guide<\/a><\/strong><\/li>\n<\/ol>\n<p>From Sergios Karagiannakos, the founder of AI Summer, this article serves as a meaty guide to deep learning. It introduces many topics, from the different kinds of neural networks to deep learning baselines in NLP and computer vision.<\/p>\n<ol start=\"8\">\n<li><strong><a href=\"https:\/\/lambdalabs.com\/blog\/demystifying-gpt-3\/\" target=\"_blank\" rel=\"noopener noreferrer\">OpenAI&#8217;s GPT-3 Language Model: A Technical Overview<\/a><\/strong><\/li>\n<\/ol>\n<p>As mentioned previously, Open AI\u2019s launch of GPT-3 was likely the biggest news in NLP so far this year.<\/p>\n<p><img class=\"aligncenter size-full wp-image-115071\" src=\"https:\/\/www.kdnuggets.com\/wp-content\/uploads\/Fig3-Ambalina-must-read-nlp-deep-learning-articles.jpg\" alt=\"\" width=\"90%\"><\/p>\n<p>For those of you that don\u2019t know, GPT-3 is a text-generating neural network that has 175 billion parameters, which is incredibly larger than the previous model, GPT-2 (1.5 billion parameters). This guide serves as a great overview of the model, with key takeaways and explanations about the model and data used to train it.<\/p>\n<ol start=\"9\">\n<li><strong><a href=\"http:\/\/dailynous.com\/2020\/07\/30\/philosophers-gpt-3\/\" target=\"_blank\" rel=\"noopener noreferrer\">Philosophers On GPT-3 (updated with replies by GPT-3)<\/a><\/strong><\/li>\n<\/ol>\n<p>From Daily Nous, this is an interesting thought piece where 9 philosophers take a deep dive into Open AI\u2019s GPT-3. These thought leaders explore the possible ethical and moral issues, as well as the lingering questions brought forth by the technology.<\/p>\n<ol start=\"10\">\n<li><strong><a href=\"https:\/\/lionbridge.ai\/articles\/end-to-end-multiclass-image-classification-using-pytorch-and-transfer-learning\/\" target=\"_blank\" rel=\"noopener noreferrer\">End to End Multiclass Image Classification Using Pytorch and Transfer Learning<\/a><\/strong><\/li>\n<\/ol>\n<p>From Rahul Agarwal, a data scientist at WalmartLabs, this guide is a step-by-step tutorial on creating a multiclass image classification model. Furthermore, Agarwal explains what transfer learning is and how to use it to improve your own image classification models.<\/p>\n<ol start=\"11\">\n<li><strong><a href=\"https:\/\/onezero.medium.com\/the-100-year-history-of-self-driving-vehicles-10b8546a3318\" target=\"_blank\" rel=\"noopener noreferrer\">The 100-Year History of Self-Driving Cars<\/a><\/strong><\/li>\n<\/ol>\n<p>This feature piece from OneZero talks about the long history of autonomous vehicles, from the first manual auto-pilot maneuvers on ships to the self-driving cars we see from the likes of Tesla and Google today.<\/p>\n<p>\u00a0<\/p>\n<h3>Additional Tools &amp; Resources<\/h3>\n<p>\u00a0<\/p>\n<ol start=\"12\">\n<li><strong><a href=\"https:\/\/www.kdnuggets.com\/2020\/07\/5-fantastic-nlp-books.html\" target=\"_blank\" rel=\"noopener noreferrer\">5 Fantastic Natural Language Processing Books<\/a><\/strong><\/li>\n<\/ol>\n<p><img class=\"aligncenter size-full wp-image-115072\" src=\"https:\/\/www.kdnuggets.com\/wp-content\/uploads\/Fig4-Ambalina-must-read-nlp-deep-learning-articles.jpg\" alt=\"\" width=\"90%\"><\/p>\n<p>Written by KDnuggets Editor Matthew Mayo, this useful guide introduces five books on NLP from his personal library. Unlike other book lists you may find online, Matthew has personally read all of these books and vouches for their quality. Please note that these books are not free, so they require a bit of investment on your part.<\/p>\n<ol start=\"13\">\n<li><strong><a href=\"https:\/\/www.kdnuggets.com\/2020\/06\/dont-click-this-how-spot-deepfakes.html\" target=\"_blank\" rel=\"noopener noreferrer\">Tools to Spot Deepfakes and AI-Generated Text<\/a><\/strong><\/li>\n<\/ol>\n<p>With the rampant spread of misinformation on social media, I was very concerned when I saw this spread reach my own inner circles. As it has become easier and easier to create deepfakes and generate fake articles using AI, I wanted to help combat the malicious use of these technologies. This article introduces a few simple methods and browser plugins that may help you detect both deepfakes and AI-generated text.<\/p>\n<ol start=\"14\">\n<li><strong><a href=\"https:\/\/lionbridge.ai\/datasets\/tensorflow-datasets-machine-learning\/\" target=\"_blank\" rel=\"noopener noreferrer\">30 Largest TensorFlow Datasets for Machine Learning<\/a><\/strong><\/li>\n<\/ol>\n<p>This listicle is a simple curation of the largest datasets in the TensorFlow library that may prove useful in improving your deep learning models. It introduces the largest audio, video, image, and text datasets on the platform and some of their intended use cases.<\/p>\n<ol start=\"15\">\n<li><strong><a href=\"https:\/\/www.toptal.com\/developers\/machine-learning\/hourly-rate\" target=\"_blank\" rel=\"noopener noreferrer\">Machine Learning Developer Hourly Rate Calculator<\/a><\/strong><\/li>\n<\/ol>\n<p>From Toptal, this handy tool can help you determine the average hourly rate for data scientists based on your location, programming languages, and skills. You can use this calculator to compare the average salary for your position in your own country and other countries to help you evaluate your career and plan your next steps.<\/p>\n<p>We hope that these NLP and deep learning articles and guides helped you catch up with some of the big things happening in machine learning this year. For more reading, please take a look at the top stories below.<\/p>\n<p><b>Related:<\/b><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/www.kdnuggets.com\/2020\/08\/must-read-nlp-deep-learning-articles.html<\/p>\n","protected":false},"author":0,"featured_media":562,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[2],"tags":[],"_links":{"self":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/posts\/561"}],"collection":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/comments?post=561"}],"version-history":[{"count":0,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/posts\/561\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/media\/562"}],"wp:attachment":[{"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/media?parent=561"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/categories?post=561"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wealthrevelation.com\/data-science\/wp-json\/wp\/v2\/tags?post=561"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}