主題介紹
這個月是我們讀書肚的阿帕契(Apache)月,跟李蒨蓉沒有關係,是我們要介紹兩個最近紅透半邊天的 Apache Project:分散式/海量資料運算的 Apache Spark ,與 data center 專用的 Operation System,Apache Mesos。
身為矽谷軟體工程師,談到 Big Data Analysis ,Machine Learning,或 Parallelism,你不能不知道當今 Yahoo,eBay,Neflix 爭相採用的 Apache Spark,談到 Data Center 的自動化或是 Resource Management ,你不能不知道現下 Airbnb,Twitter,Apple 趨之若鶩的 Apache Mesos。
這次,讀書肚一次帶給你。
聚會議程
Apache Spark(2:15 pm to 2:45 pm)
Modern Techniques in Big Data Science
Apache Spark is becoming one of the most gossiping and the state-of-the-art framework to conduct data analysis on Hadoop platform. in this talk. I will discuss why PIG is fading out and why Java is not suitable for the big data analysis these days. Next, I will start with explaining the basic idea/motivation behind big data analysis (in my perspective), followed by explaining operations like “map, reduce, fold, join, etc.” and wrapped up with real world examples, including page rank calculation and clustering data sets.
Chu-Cheng Hsieh – Applied Researcher at eBay

Apache Mesos(2:45 pm to 3:15pm)
Introduction to Apache Mesos
Come learn how Apache Mesos, an open source distributed cluster
manager, can allow Twitter to have only three full-time SREs to manage
10s of thousands of nodes running in their datacenters and achieve
high utilization.
Timothy Chen – Distributed Systems Engineer

時間
16 May @ 2pm
地點
超級感謝 Salesforce 的 Benjamin Tsai 大力幫忙喬場地,我們這次在 Salesforce Rincon Center 一樓的 Cafe / meeting room 空間舉行,確切地點請點我看地圖。
到達 Rincon Center 後請來電通知 Winston (四一五,四零一,五一三五),因為門是鎖住的,我們必須要過去接你進來。
請在下圖這個門這邊打給我:

遠道而來到朋友們,Rincon Center 地下室也有停車場,收費 10 元。
參加方式
請愛用 Facebook Event Page 參加,因為要控制人數,所以請想來的人按下『參加』喔:
https://www.facebook.com/events/387598671435364/