{"id":3907,"date":"2020-09-26T06:36:20","date_gmt":"2020-09-26T06:36:20","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2020\/09\/26\/gpt3-and-agi-beyond-the-dichotomy-part-two\/"},"modified":"2020-09-26T06:36:20","modified_gmt":"2020-09-26T06:36:20","slug":"gpt3-and-agi-beyond-the-dichotomy-part-two","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2020\/09\/26\/gpt3-and-agi-beyond-the-dichotomy-part-two\/","title":{"rendered":"GPT3 and AGI: Beyond the dichotomy \u2013 part two"},"content":{"rendered":"<p>Author: ajit jaokar<\/p>\n<div>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/7970058294?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/7970058294?profile=RESIZE_710x\" class=\"align-full\"><\/a><\/p>\n<p>This blog continues from <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/gpt3-and-agi-beyond-the-dichotomy-part-one\" target=\"_blank\" rel=\"noopener noreferrer\">GPT3 and AGI: Beyond the dichotomy &ndash; part one<\/a><\/p>\n<p><strong>GPT3 and AGI<\/strong><\/p>\n<p>Let&rsquo;s first clarify what AGI should look like<\/p>\n<p>Consider the movie &lsquo;Terminator&rsquo;<\/p>\n<p>When the Arnold Schwarzenegger character comes to earth &ndash; he is fully functional. To do so, he must be aware of the context. In other words, AGI should be able to operate in <strong>any<\/strong> context<\/p>\n<p>Such an entity does not exist<\/p>\n<p>And nor is GPT3 such an entity<\/p>\n<p>But GPT3 however has the capacity to respond &lsquo;AGI-like&rsquo; to an expanded set of contexts much more than traditional AI systems.<\/p>\n<p>GPT 3 has got many things going for it<\/p>\n<ul>\n<li>Unsupervised learning is the future<\/li>\n<li>Linguistic capabilities distinguish humans<\/li>\n<li>But Language is much more than encoding information. At a social level, language involves&nbsp;joint attention&nbsp;to environment, expectations and patterns.<\/li>\n<li>Attention serves as a foundation for social trust<\/li>\n<li>Hence, AGI needs a linguistic basis &ndash; but that needs attention and attention needs context. So, GPT-3 &ndash; linguistic &ndash; attention &ndash; context could lead to AGI-like behaviour<\/li>\n<\/ul>\n<p><span style=\"text-decoration: underline;\"><strong>Does AGI need to be conscious as we know it or would access consciousness suffice?<\/strong><\/span><\/p>\n<p>In this context, a recent paper<\/p>\n<p>A Roadmap for Artificial General Intelligence: Intelligence, Knowledge, and Consciousness: Garrett Mindt and Carlos Montemayor<\/p>\n<p><a href=\"https:\/\/www.academia.edu\/43620181\/A_Roadmap_for_Artificial_General_Intelligence_Intelligence_Knowledge_and_Consciousness\">https:\/\/www.academia.edu\/43620181\/A_Roadmap_for_Artificial_General_Intelligence_Intelligence_Knowledge_and_Consciousness<\/a> makes an argument is that<\/p>\n<ul>\n<li>integrated information in the form of attention suffices for AGI<\/li>\n<li>AGI must be understood in terms of epistemic agency, (epistemic = relating to knowledge or the study of knowledge) and<\/li>\n<li>Eepistemic agency necessitates access consciousness.<\/li>\n<li>access consciousness: acquiring knowledge for action, decision-making,&nbsp; and&nbsp; thought,&nbsp; without&nbsp; necessarily&nbsp; being&nbsp; conscious<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>Therefore, the proposal goes that AGI necessitates&nbsp;<\/p>\n<ul>\n<li>selective attention for accessing information relevant to action,&nbsp; decision-making,&nbsp; memory&nbsp; and&nbsp; &nbsp;&nbsp;<\/li>\n<li>But not necessarily consciousness as we know it<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>This line of thinking leads to many questions<\/p>\n<ul>\n<li>Is consciousness necessary for AGI?<\/li>\n<li>If so, should that consciousness be the same as human consciousness<\/li>\n<li>Intelligence is typically understood in terms of problem-solving. Problem solving by definition leads to specialized mode of evaluation. Such tests are easy to formulate but check for compartmentalized competency (which cannot be called intelligence). They also do not allow intelligence to &lsquo;spill over&rsquo; from one domain to another &ndash; as it does in human intelligence.&nbsp;<\/li>\n<li>Intelligence needs information to be processed in a contextually relevant way.<\/li>\n<li>Can we use epistemic&nbsp; agency&nbsp; through&nbsp; attention as the distinctive mark of general intelligence even without consciousness? (as per Garrett Mindt and Carlos Montemayor)<\/li>\n<li>In this model, AGI is based on joint attention to preferences in a context sensitive way.<\/li>\n<li>Would AI be a peer or subservient in the joint attention model?<\/li>\n<\/ul>\n<p>Finally, let us consider the question of spillover of intelligence. In my view, that is another characteristic of AGI. Its not easy to quantify because tests are specific to problem types currently. A recent example of spillover of intelligence is <strong><em>from facebook AI supposedly inventing it&rsquo;s own secret language.<\/em><\/strong> The media would have you believe that groups of AGI are secretly plotting to take over humanity. But the reality is a bit mundane as explained. <a href=\"https:\/\/towardsdatascience.com\/the-truth-behind-facebook-ai-inventing-a-new-language-37c5d680e5a7\">The truth behind facebook AI inventing a new language<\/a><\/p>\n<p>In a nutshell, the system was using Reinforcement learning. Facebook was trying to create a robot that could negotiate. To do this, facebook let two instances of the robot negotiate with each other &ndash; and learn from each other. The only measure of their success was how well they transacted objects. The only rule to follow was to put words on the screen. As long as they were optimizing the goal(negotiating) and understood each other it did not matter that the language was accurate (or indeed was English). Hence, the news about &lsquo;inventing a new language&rsquo;. <strong><em>But to me, the real question is: does it represent intelligence spillover?<\/em><\/strong><\/p>\n<p>Much of future AI could be in that direction.<\/p>\n<p><strong><u>To Conclude<\/u><\/strong><\/p>\n<p>We are left with some key questions: <strong>&nbsp;<\/strong><\/p>\n<ul>\n<li>Does AGI need consciousness or access consciousness?<\/li>\n<li>What is role of language in intelligence?<\/li>\n<li>GPT3 has reopened the discussion but still hype and dichotomy (both don&rsquo;t help because hype misdirects discussion and dichotomy shuts down discussion)<\/li>\n<li>Does the &lsquo;Bitter lesson&rsquo; apply? If so, what are its implications?<\/li>\n<li>Will AGI see a take-off point like Google translate did?<\/li>\n<li>What is the future of bias reduction other than what we see today?<\/li>\n<li>Can bias reduction improve human insight and hence improve Joint attention?<\/li>\n<li>GPT-3 &ndash; linguistic &ndash; attention &ndash; context<\/li>\n<li>If context is the key, what other ways can be to include context?<\/li>\n<li>Does problem solving compartmentalize intelligence?<\/li>\n<li>Are we comfortable with the &lsquo;spillover&rsquo; of intelligence in AI? &ndash; like in the facebook experiment<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p><strong><u>References<\/u><\/strong><\/p>\n<p><a href=\"https:\/\/towardsdatascience.com\/gpt-3-the-first-artificial-general-intelligence-b8d9b38557a1\">https:\/\/towardsdatascience.com\/gpt-3-the-first-artificial-general-intelligence-b8d9b38557a1<\/a><\/p>\n<p><a href=\"https:\/\/www.gwern.net\/GPT-3\">https:\/\/www.gwern.net\/GPT-3<\/a><\/p>\n<p><a href=\"http:\/\/haggstrom.blogspot.com\/2020\/06\/is-gpt-3-one-more-step-towards.html\">http:\/\/haggstrom.blogspot.com\/2020\/06\/is-gpt-3-one-more-step-towards.html<\/a><\/p>\n<p><a href=\"https:\/\/nordicapis.com\/on-gpt-3-openai-and-apis\/\">https:\/\/nordicapis.com\/on-gpt-3-openai-and-apis\/<\/a><\/p>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/what-is-s-driving-the-innovation-in-nlp-and-gpt-3\">https:\/\/www.datasciencecentral.com\/profiles\/blogs\/what-is-s-driving-the-innovation-in-nlp-and-gpt-3<\/a><\/p>\n<p><a href=\"https:\/\/bdtechtalks.com\/2020\/08\/17\/openai-gpt-3-commercial-ai\/\">https:\/\/bdtechtalks.com\/2020\/08\/17\/openai-gpt-3-commercial-ai\/<\/a><\/p>\n<p><a href=\"https:\/\/aidevelopmenthub.com\/joscha-bach-on-gpt-3-achieving-agi-machine-understanding-and-lots-more-artificial\/\">https:\/\/aidevelopmenthub.com\/joscha-bach-on-gpt-3-achieving-agi-machine-understanding-and-lots-more-artificial\/<\/a><\/p>\n<p><a href=\"https:\/\/medium.com\/@ztalib\/gpt-3-and-the-future-of-agi-8cef8dc1e0a1\">https:\/\/medium.com\/@ztalib\/gpt-3-and-the-future-of-agi-8cef8dc1e0a1<\/a><\/p>\n<p><a href=\"https:\/\/www.everestgrp.com\/2020-08-gpt-3-accelerates-ai-progress-but-the-path-to-agi-is-going-to-be-bumpy-blog-.html\">https:\/\/www.everestgrp.com\/2020-08-gpt-3-accelerates-ai-progress-but-the-path-to-agi-is-going-to-be-bumpy-blog-.html<\/a><\/p>\n<p><a href=\"https:\/\/www.theverge.com\/21346343\/gpt-3-explainer-openai-examples-errors-agi-potential\">https:\/\/www.theverge.com\/21346343\/gpt-3-explainer-openai-examples-errors-agi-potential<\/a><\/p>\n<p><a href=\"https:\/\/www.theguardian.com\/commentisfree\/2020\/aug\/01\/gpt-3-an-ai-game-changer-or-an-environmental-disaster\">https:\/\/www.theguardian.com\/commentisfree\/2020\/aug\/01\/gpt-3-an-ai-game-changer-or-an-environmental-disaster<\/a><\/p>\n<p><a href=\"http:\/\/dailynous.com\/2020\/07\/30\/philosophers-gpt-3\/\">http:\/\/dailynous.com\/2020\/07\/30\/philosophers-gpt-3\/<\/a><\/p>\n<p><a href=\"https:\/\/marginalrevolution.com\/marginalrevolution\/2020\/07\/gpt-3-etc.html\">https:\/\/marginalrevolution.com\/marginalrevolution\/2020\/07\/gpt-3-etc.html<\/a><\/p>\n<p><a href=\"https:\/\/artificialintelligence-news.com\/2020\/09\/10\/experts-misleading-claim-openai-gpt3-article\/\">https:\/\/artificialintelligence-news.com\/2020\/09\/10\/experts-misleading-claim-openai-gpt3-article\/<\/a><\/p>\n<p><a href=\"https:\/\/analyticsindiamag.com\/gpt-3-is-great-but-not-without-shortcomings\/\">https:\/\/analyticsindiamag.com\/gpt-3-is-great-but-not-without-shortcomings\/<\/a><\/p>\n<p><a href=\"https:\/\/www.3mhisinsideangle.com\/blog-post\/ai-talk-gpt-3-mega-language-model\/\">https:\/\/www.3mhisinsideangle.com\/blog-post\/ai-talk-gpt-3-mega-language-model\/<\/a><\/p>\n<p><a href=\"https:\/\/venturebeat.com\/2020\/06\/01\/ai-machine-learning-openai-gpt-3-size-isnt-everything\/\">https:\/\/venturebeat.com\/2020\/06\/01\/ai-machine-learning-openai-gpt-3-size-isnt-everything\/<\/a><\/p>\n<p><a href=\"https:\/\/discourse.numenta.org\/t\/gpt3-or-agi\/7805\">https:\/\/discourse.numenta.org\/t\/gpt3-or-agi\/7805<\/a><\/p>\n<p><a href=\"https:\/\/futureofintelligence.com\/2020\/06\/30\/is-this-agi\/\">https:\/\/futureofintelligence.com\/2020\/06\/30\/is-this-agi\/<\/a><\/p>\n<p><a href=\"https:\/\/www.quora.com\/Can-we-achieve-AGI-by-improving-GPT-3\">https:\/\/www.quora.com\/Can-we-achieve-AGI-by-improving-GPT-3<\/a><\/p>\n<p><a href=\"https:\/\/bmk.sh\/2020\/08\/17\/Building-AGI-Using-Language-Models\/\">https:\/\/bmk.sh\/2020\/08\/17\/Building-AGI-Using-Language-Models\/<\/a><\/p>\n<p><a href=\"https:\/\/news.ycombinator.com\/item?id=23891226\">https:\/\/news.ycombinator.com\/item?id=23891226<\/a><\/p>\n<p>&nbsp;<\/p>\n<p><span>image source:&nbsp;<\/span><a rel=\"nofollow noopener noreferrer\" href=\"https:\/\/www.youtube.com\/watch?v=ocALxrFa8w8\" target=\"_blank\">Learn English Words: DICHOTOMY<\/a><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:982110\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: ajit jaokar This blog continues from GPT3 and AGI: Beyond the dichotomy &ndash; part one GPT3 and AGI Let&rsquo;s first clarify what AGI should [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2020\/09\/26\/gpt3-and-agi-beyond-the-dichotomy-part-two\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":466,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/3907"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=3907"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/3907\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/459"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=3907"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=3907"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=3907"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}