1 / 11

25.Big Data with Hadoop and Cloud Computing

25.Big Data with Hadoop and Cloud Computing

MitSoni
Télécharger la présentation

25.Big Data with Hadoop and Cloud Computing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


    1. Big Data with Hadoop and Cloud Computing

    2. Big Data Processing relevant for Enterprises Big Data used to be discarded or un-analyzed & archived. Loss of information, insight, and prospects to extract new value. How Big Data is beneficial? Energy companies - Geophysical analysis. Science and medicine - Empiricism is growing than experimentation Disney Customer behavior patterns across its stores, and theme parks Pursuit of a Competitive Advantage is the driving factor for Enterprises Data mining (Log processing, click stream analysis, similarity algorithms, etc.), Financial simulation (Monte Carlo simulation), File processing (resize jpegs), Web indexing

    3. Cloud Computing ~ brings economy to Big Data Processing Big Data Processing can be implemented by HPC & Cloud. 1) HPC implementation is very costly w.r.t. CAPEX & OPEX. 2) Cloud Computing is efficient because of its paper use nature. MapReduce programming model is used for processing big data sets. Pig, Hive, Hadoop, are used for Big data Processing Pig - SQL-like operations that apply to datasets., Hive - Perform SQL-like data analysis on data Hadoop - processes vast amounts of data; (Focal point) Use EC2 instances to analyze Big Data in Amazon IaaS. Amazon MapReduce reduces complex setup & Magt.

    4. Cost Comparison of Alternatives

    5. Future Direction Current Experiments & Identified areas Social network analysis Managing Data center Collective Intelligence - Algorithms and Visualization techniques Predictive analytics Accelerators Exploration Apache Whirr - Cloud-neutral way to run services Apache Mahout - Scalable machine learning library Cascading - Distributed computing framework HAMA - define and execute fault tolerant data processing workflows Exploration of LAMP-like stack for Big Data aggregation, processing and analytics

    6. Download with Linkedin Username/Password

    7. Download with Linkedin Username/Password

    8. Download with Linkedin Username/Password

    9. Download with Linkedin Username/Password

    10. Download with Linkedin Username/Password

More Related