1 / 48

Cloud Computing Overview: Big Data and Business Analytics Hsinchun Chen University of Arizona

Cloud Computing Overview: Big Data and Business Analytics Hsinchun Chen University of Arizona. Interesting Questions Cloud Computing Applications Big Data Analytics Business Models ( CIA ). Cloud Computing Applications: Overview and Examples. IQ: How Amazon makes its money?.

zody
Télécharger la présentation

Cloud Computing Overview: Big Data and Business Analytics Hsinchun Chen University of Arizona

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Cloud Computing Overview: Big Data and Business AnalyticsHsinchun ChenUniversity of Arizona

  2. Interesting QuestionsCloud Computing ApplicationsBig Data AnalyticsBusiness Models (CIA)

  3. Cloud Computing Applications:Overview and Examples

  4. IQ: How Amazon makes its money?

  5. Cloud computing: applications, system software, and hardware delivered as services over the Internet. • Service oriented architecture + virtualization + utility computing • Software as a Service (SaaS), Infrastructure as a Service (IaaS), Platform as a Service (PaaS) • From web services to cloud computing applications • Moving towards cloud applications and cloud business models, e.g., SaleForce.com, Apple iTune, Amazon Cloud Computing Overview

  6. Major Could Computing Platforms • Amazon Elastic Compute Cloud (EC2):LAMP (Linux, Apache, mySQL, and PHP) stack • Google App Engine:Java and Python runtime, Java Persistence API (JPA), Google Bigtable, File systems; Hadoop, MapReduce • Windows Azure: .Net, MS SQL, SharePoint

  7. E-Commerce: B2C, life style & entertainment, global supply-chain, banking, telecommunications, IT hosting, business intelligence and analytics • E-Government: government data sources, services • E-Education: online education content delivery • E-Security: cybersecurity, intelligence • E-Health: healthcare big data, healthcare 2.0; genomics + EHR Emerging Applications

  8. National Electronic Health Record Data Bank, Singapore: MOH + Accenture, August 2010; healthcare management, quality and performance management, EHR information aggregation, patient self management, decision support • E-Health, E-Health Cloud, England: Chelsea Westminster Hospital + Flexiant, July 2011, patient EHR access • CareStream Cloud, US: Carestream Health (Onex + Kodak), 2009; health imaging sharing, 1B medical images, health cloud SaaS vendor • Taiwan Smart Health Cloud, NTU & NCKU (Sources: NTU Health Cloud proposal) Selected Health Cloud Initiatives

  9. IQ: What’s the difference between 2005 and 2012 for web computing?

  10. Web Computing and Mining • Emerging web applications  business models • Web services, APIs, mashups  cloud & mobile computing • Business analytics  Data, text and web mining

  11. Web Services and Computing (No Cloud), 2005 (Web 2.0)-2011

  12. 50 Projects, 2005-2012(“Business Web Mining Using Amazon, Google, eBay, and Google”) • E-commerce and e-Services: iRelocateRealTomatoesSmallBHHobbyCentralNewPlaceSeek College AdvisorFriendly GifterClipperGottaCouchSkiStopvTrack Barter BayLink-USSmart Gift CardTimely BidTucson Gamer CaféTV and More DeliverablesCellphone Intelligent AuctioningTucson Book ExchangeSciBubbleWish SkyGiftChannelPriceSmartWetYourWhistle • Life Style and Entertainment: BetSmartXTREME F1MLB100YardsCricWeb iBollywoodSa Ri Ga MaWOWBollywoodFunzicHinduShrines IndiapaaruNachBaliyeMovie Location QuestRemakesSugarSuite MusicBoxArtist ConnectionConcertoStar Search • Government and Education: RepCheckSmallNGreenCarsChange of BaseiDogTasty ParkiSupport

  13. SmallNGreenCars

  14. SmallNGreenCars

  15. By Kumar Vakeel, Kunal Jain, Neeraj Munshi; MS MIS, Spring 2010 • One-stop portal for green cars information and resources • Unique Concept • Global customers • Youtube vehicle videos • Flickr vehicle photos • Google Maps and Local Search • Google visualization • RSS feeds of global vehicle news • Facebook recommendation from friends • Yahoo Finance for currency exchange • Google Translate for web pages • Recommendation System • Fuel Efficiency Challenge SmallNGreenCars

  16. SmallNGreenCars

  17. Sa Ri Ga Ma

  18. Sa Ri Ga Ma

  19. Sarigama.com latest news and RSS Feeds • Artist information • Transliteration • Music play and video • Shopping • Lessons and Library • Concert locator • Forums • Interactive Features • Tag Clouds • Lyrics Recommender system Sa Ri Ga Ma • Mahalakshmi Sundararajan, Pavithra Ravi, Sahana Nagaraja; Spring 2010 • Carnatic Music: One of the two main genres of Indian classical music; Mostly performed vocally • Sarigama.com: one stop information portal for carnatic music

  20. Sa Ri Ga Ma

  21. Web Services, Cloud Computing, and Mobile Web, 2012 (Web 3.0)

  22. 25 Projects, 2012Cloud and Mobile Computing • E-commerce and e-Services: GamerzLykMeMobileAppPortalGemstonesPersonalInvestment iScreamiRace SeeMeSocialAZRegionTrendHelpMeAZ • Health & Life Style: EatRightOrganiCookRoadTripXtravelWreckDiversVoiceOfNatureHealthMiners HelpAsthmaDiabeatUSHikeAdayYogaWorldBikersParadiseYogaWorldBikersParadise

  23. OrganiCook

  24. OrganiCook • By Zilong Chang, Mengwen Cheng, Yajie Wang, and Haiqing Wu, Spring 2012 • One-stop portal for healthy foods • Organic food supplier location • Different health concerned recipe catalogs • Integrate healthy content with social media • Text mining for cookware recommendation • Mark allergens among ingredients • Provide health news • Advertisement • Unique recommendation system • Amazon EC2 Cloud server • Intetergrate Mahout with Hadoop

  25. OrganiCook

  26. OrganiCook User Cloud Application Server Apache Tomcat J2EE REST API Browser Internet Connection Amazon EC2 Mahout Taste Data Mining JavaScript API MySQL 5.5 API Servers Database server

  27. EatRight

  28. EatRight • By Jim Marquardson, Justin William, Dave Wilson, and Mark Grimes, Spring, 2012 • Health & nutrition mobile site • True SoLoMo (Web 3.0) • Nutrition based meal shopping • Capturing user preferences: “Eat This” button • Directed search advertising rates • Targeted ads based on nutrition preferences and location • EatRight API • Twitter Sentiment • PCI Compliant Credit Card Processing • Amazon EC2 Cloud • Android Mobile App (iOS too!)

  29. EatRight

  30. Big Data & Business Analytics

  31. IQ: Size (storage) of LOC book collection?

  32. IQ: What is a Yottabyte & who owns it?

  33. The Data Deluge (Big Data) • The Economists, March 2010 • LOC total book collection 15 TBs • Google processes 10 PBs per day • Internet traffic 667 Exabytes by 2013, Cisco • Total amount of world information in 2010, 1.2 Zettabyte • KB-MB-GB-TB-PB-EB-ZB-Yottabyte • E-Commerce, Government, Health, Security applications: many with TB/PB of valuable content from customers, citizens, patients, etc.

  34. BI & Analytics: The Market • $3B BI revenue in 2009 (Gartner, 2006); $9.4B BI software M&A spending in 2010 and $14.1B by 2014 (Forrester) • IBM spent $14B in BI in five years; $9B BI revenue in 2010 (USA Today, November 2010); 24 acquisitions, 10,000 BI software developers, 8,000 BI consultants, 200 BI mathematicians  Acquired i2/COPLINK in 2011

  35. BI & Analytics: Definition and Components • BI and Analytics refers to: (1) the technologies, systems, practices and applications that (2) analyze critical business data to (3) help an enterprise better understand its business and market.” • Core technologies: data warehousing, Extraction, Transformation, and Load (ETL); Business Performance Management (BPM), visual dashboards; data and text mining, social network analysis • BI 2.0 & 3.0 research: web analytics, web 2.0; in-memory and real-time BI; web 3.0, cloud computing, Hadoop, MapReduce; mobile computing, stream data mining

  36. Big Data Analytics Research at UA/AI Lab • Applications/problems: digital libraries, search engines, biomedical informatics, healthcare data mining, security informatics, business intelligence • Approaches: web collection/spidering, databases, data warehousing, data mining, text mining, web mining, statistical NLP, ontologies, social media analytics, interface design, information visualization, economic modeling, assessment • Structure: federal funding, director, affiliated faculty, post-docs, Ph.D./MS/BS students  commercialization • Major phases: DLI  COPLINK  Dark Web  DiabeticLink

  37. Business Models

  38. IQ: What is “CIA” and their differences?

  39. CIA in the Global IT Landscape • Central Intelligence Agency; Culinary Institute of America • Chinese: math/science, team player, IT/hardware/web, China market (China) • Indians: math/science, entrepreneurial spirit, English • Americans: English, entrepreneurial spirit, IT/software, business development, market (US), VC access ($)

  40. My COPLINK Experience • Taiwan/US Training: NCTU (math)  SUNY Buffalo (MBA)  NYU (AI)  U of Arizona (top 3) • AI Lab: Digital Library  COLINK  Dark Web  DiabeticLink • COPLINK federal funding ($4M), NSF/NIJ, 1997-2002 • COPLINK commercialization ($4.6M), angels/VCs (Taiwan, CA, AZ), 2000 & 2003 • Customer sales ($30M), 4,500 agencies, 120 FTEs, 2000-2011 • M&A Exit, Silverlake/i2/IBM acquisition, 2009 (i2), 2011 (IBM); $500M valuation

  41. COPLINK Identity Resolution and Criminal Network Analysis (DHS) • Funding: NSF, DOJ, DHS ($4M), VCs ($4.6M); Digital Government • Publications: ACM TOIS, CACM, IEEE TKDE, IEEE IS, JASIST, DSS • Impact: 3500 agencies, 25 NATO countries, 1M users  public safety

  42. The New York Times, November 2, 2002 COPLINK assisted in DC sniper investigation ABC News  April 15, 2003 Google for Cops: Coplink software helps police search for cyber clues to bust criminals Newsweek Magazine,  March 3, 2003 A computerized way for police to coordinate crime databases Washington Post, March 6, 2008, COPLINK in use in 3,500 police agencies in US! COPLINK acquired by i2 (Silver Lake) in 2009; i2/COPLINK acquired by IBM in 2011 for $500M

  43. IT Business Models: Some Thoughts • Startup Phase: business ideas (product and market), team (founders & mentors), share structure (shares, directors, options; legal/CPA), business plan (short plan, good introduction), funding (government, angels, VCs, family)  Year 0, 1-3 founders, $250K funding (IT/cloud) • Early Phase: first product, product positioning, team building, initial sales  Years 1-3, $500K sales • Growth Phase: products plan, strong sales team, sustainable revenues, unique IPs (SW, content), loyal customers  Years 3-8, $10M sales • Exit Phase: IPO or M&A (partners), when ($20M+), next venture Taking risks!

  44. Pain, Sorrow, and Regret • Loss of family time/life (but never money) • Managing university obligations and COI • University bureaucracy, Office of Technology Transfer (OPTT) • Lawyers, accountants are expensive • Chasing angels/VCs (40 frogs  1 prince) • Office, employees, products • Selling products (becoming a vendor) • Burning cash • Bubble burst • Raising second round funding when you are down ($2M) • Board room yelling matches • University accusations • Losing control and shares • Anti-dilution clause (losing $60M for the $2M you never used)

  45. hchen@eller.Arizona.edu http://ai.Arizona.edu

More Related