[ Video Link ] 1) Load apache logs, limit by 10 2) Load apache logs, limit by 100 3) Load apache logs, filter by method == 'GET', group by referer, get a count, order by count desc, limit 20 and export to excel. If it ain't in Excel, it ain't real to most people. This lets you crunch big data down into Excel accessible scopes, using Hadoop, via Apache Pig. |