{"id":1748,"date":"2019-02-20T06:34:36","date_gmt":"2019-02-20T06:34:36","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2019\/02\/20\/python-machine-learning-libraries\/"},"modified":"2019-02-20T06:34:36","modified_gmt":"2019-02-20T06:34:36","slug":"python-machine-learning-libraries","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2019\/02\/20\/python-machine-learning-libraries\/","title":{"rendered":"Python machine learning libraries"},"content":{"rendered":"<p>Author: ajit jaokar<\/p>\n<div>\n<h1><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/1157930838?profile=original\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/1157930838?profile=RESIZE_710x\" class=\"align-full\"><\/a>Introduction<\/h1>\n<p>This blog is a part of the <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/learn-machinelearning-coding-basics-in-a-weekend-a-new-approach\">learn machine learning coding basics in a weekend<\/a> . We recommend the book <a href=\"https:\/\/jakevdp.github.io\/PythonDataScienceHandbook\/\">Python Data Science Handbook by Jake VanderPlas<\/a><\/p>\n<p>There are four main libraries in Python that you need to know: numpy, pandas, mathplotlib and sklearn<\/p>\n<h1>NumPy<\/h1>\n<p>The Python built-in list type does not allow for efficient array manipulation. The NumPy package is concerned with manipulation of multi-dimensional arrays. NumPy is at the foundation of almost all the other packages covering the Data Science aspects of Python. From a Data Science perspective, collections of Data types like Documents, Images, Sound etc can be represented as an array of numbers. Hence, the first step in analysing data is to transform data into an array of numbers.\u00a0 NumPy functions are used for transformation and manipulation of data as numbers \u2013 especially before the model building stage \u2013 but also in the overall process of data science.<\/p>\n<h1>Pandas<\/h1>\n<p>The Pandas library in Python provides two data structures: The <strong>DataFrame and the Series<\/strong> object. \u00a0The Pandas Series Object is a one-dimensional array of indexed data which can be created from a list or array. The Pandas DataFrames objects are essentially multidimensional arrays with attached row and column labels. A DataFrame is roughly equivalent to a \u2018Table\u2019 in SQL or a spreadsheet. \u00a0Through the Pandas library, Python implements a number of powerful data operations similar to database frameworks and spreadsheets. While the NumPy\u2019s ndarray data structure provides features for numerical computing tasks, it does not provide flexibility that we see in Tale structures (such as attaching labels to data, working with missing data, etc.). The \u00a0Pandas \u00a0library thus provides features for \u00a0data manipulation tasks.<\/p>\n<h1>Matplotlib<\/h1>\n<p>The Matplotlib library is used for data visualization in Python built on numpy. Matplotlib works with multiple operating systems and graphics backends.<\/p>\n<h1>Scikit-Learn<\/h1>\n<p>The Scikit-Learn package provides efficient implementations of a number of common machine learning algorithms. It also includes modules for cross validation, grid search and feature engineering<\/p>\n<p>For more details on how to apply these ideas see\u00a0<\/p>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/learn-machinelearning-coding-basics-in-a-weekend-a-new-approach\">learn machine learning coding basics in a weekend<\/a> .<\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:803693\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: ajit jaokar Introduction This blog is a part of the learn machine learning coding basics in a weekend . We recommend the book Python [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2019\/02\/20\/python-machine-learning-libraries\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":475,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/1748"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=1748"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/1748\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/456"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=1748"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=1748"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=1748"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}