1 / 3

2 days of hands-on Apache Spark & Python Workshop/train

Attend 2-day hands-on Workshop/training which covers an overview of Hadoop, Python, Apache Spark, & MachineLearning Libraries on 8th & 9th Sept, 2018 by PakkaIndia.<br>For registration send email to: info@pakkaindia.com <br>or<br>SMS/WhatsApp: 919110709655<br>

adishaa724
Télécharger la présentation

2 days of hands-on Apache Spark & Python Workshop/train

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Machine Learning 2 Day Work Shop Presented By FOR REGISTRATION SEND EMAIL TO : INFO@PAKKAINDIA.COM SMS / WhatsApp : +91 91107 09655

  2. Who should attend: • Software engineers with basic coding experience • Entry level Data scientists • Technical Managers / Team Leads Wo r k s h o p O b j e c t i v e Pre Requisites : System Requirements: • Basic Programming experience • Basic understanding of Big Data • Windows/Linux/Mac • 4 GB RAM • JDK, HADOOP, Python, SPARK, Jupyter This is a 2 day hands-on Apache Spark & Python Workshop / training where participants will be able to perform data analysis This course is 100% practical oriented and hands on which covers an overview of Hadoop, Python and Apache Spark, and Machine Learning Libraries using Spark. Participants will be able to read / write data to/from HADOOP or any Webservers. Also work with REAL TIME Streaming sources. at using Apache Spark. Participants will be able to perform hands-on projects with below objectives: * Using python scripts for web scraping and data wrangling * Utilizing extract-transform-load operations (ETL) * Employing exploratory data analysis (EDA) * Connect to HDFS & HIVE with Spark and perform data processing. * Generate insightful visualizations * Using apache spark to perform quick analysis on structured and unstructured data sets which includes files like CSV, JSON, AVRO, Parquet Focus on Machine Learning At the end of the workshop, Participants will be able to perform : • Write code to implement Transformers using standardization, normalization, one-hot encoding, and binarization. • Create a processing pipeline including transformations, estimations, evaluation of analytical models. • Using Spark Mlib evaluators to Evaluate model accuracy by dividing data into training and test datasets and computing metrics. • Tune training hyper-parameters by integrating cross-validation into Spark MLlib Pipelines. Certification : • “Participation and Course completion Certificate” from pakkaindia.com Graduates in Resume Preparation • Differentiate between supervised and unsupervised Machine Learning problems • Apply various regression and classification models • Train analytical models with Spark MLlib’s Data Frame-based estimators • Implement linear regression, decision trees, logistic regression, and k-means. • Understand purpose of Transformers to perform pre-processing on a dataset prior to training Post Workshop Support: • After the workshop, for 2 weeks duration you can send email to get clarify your doubts on the topics discussed in the workshop Internships & Resume Prep : * Internships for eligible candidates. * Support Job seekers & Fresh ? @ ! FOR REGISTRATION SEND EMAIL TO : INFO@PAKKAINDIA.COM SMS / WhatsApp : +91 91107 09655

  3. FOR REGISTRATION SEND EMAIL TO : INFO@PAKKAINDIA.COM SMS / WhatsApp : +91 91107 09655 Workshop Agenda Day 1 Day 2 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -  Introduction to BIG DATA 9:00  Recap of Machine Learning with Spark - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -  BIG DATA & Machine Learning  How to implement BIG DATA tools  How to implement SPARK  Demonstrate and Discuss 1 project –Participants will be able to see complete code execution. 9:15  Types of machine learning  Supervised Learning  Unsupervised Learning  Reinforcement Learning  How to choose algorithms???  Understand different kinds of data and file formats  JSON  AVRO  PARQUET  CSV  We will provide examples on how you can upload files to HDFS or any webserver directly. Also provide the logic to handle Structured Streaming  Machine Learning – Algorithms and Terminology  Classification  Regression  Cross – Validator  Decision Trees & other key terms  We will provide complete Cheat sheet of Machine Learning algorithms. Also Glossary & Definitions of Machine Learning Algorithms 9:45  Resource planning, Hardware Consideration, Client Interactions in such projects(FAQ) 10:15 SPARK–MlibLibraries–Purpose– Functions / Methods 10:45 - 11:00 Short Break  BIG DATA – Tools & Frameworks FAST FORWARD  Apache Spark, Python  HDFS, Hive, PIG, SQOOP, FLUME  Apache Hadoop (Live Practical examples)  MACHINE LEARNING LIVE EXAMPLE WALK THROUGH:  Evaluating the Model using Evaluators  Putting it all together using MLlib Pipelines  Training a linear Regression model using Estimators  Featurizing a DataFrame using Transformers  PYTHON – FUNDAMENTALS & PROGRAMMING  Demonstrate and discuss major Python end to end Coding solutions  We will provide 2 examples, to practice after the course 11:00  Understanding process flow of Machine Learning algorithms  Purpose of Training sets & Testing sets  How to identify Features  How to prepare MODELS & FIT  How to create Pipelines  How to transform model with training and test data 11:30  Machine Learning algorithms : Classification & Regression  Binary Classification  Multinomial Classification  Logistic Regression 12:00 1:00 – 2:00 Lunch Break  Linear regression  Decision Trees  K – Means Clustering  Caching & Persistence  Make participants hands on with predicting with regression models  Demonstrate and discuss SPARK Machine Learning algorithms end to end project.  We will provide 2 additional examples, which you can practice after the course  Spark – Hands On Fundamentals & Advanced :  RDD, Data Frames, Data Sets  SPARK SQL  SPARK Streaming  Demonstrate and discuss SPARK project end to end Coding solution  We will provide 2 additional examples, which you can practice after the course 2:00  Visualization Techniques  MATPLOT, D3JS, NVD3  Machine Learning with SPARK - String Indexer, Vector Assembler, SVM, Dense /Sparse Vectors  ML Vs MLib  Classification 5:00 Q & A – Closing session 4:00 Request participants to be at venue by sharp 8:15 AM. Breakfast will be completed by 8:45 AM. We will send installation instructions before the session. Participants can contact us for any support over the email. Participants must bring their own laptops.

More Related