Search: Parallel Data Processing

32 results

Results

Title/Author Year Citationssort icon added
He, B; Yang, M; Guo, Z; Chen, R; Su, B; Lin, W; Zhou, L
Comet: Batched Stream Processing in Data Intensive Distributed Computing
2008 Apr10
Kolb, L; Thor, A; Rahm, E
Load Balancing for MapReduce-based Entity Resolution
2012 Nov11
Kolb, L; Thor, A; Rahm, E
Block-based Load Balancing for Entity Resolution with MapReduce
2011 Aug11
Isard, M; Budiu, M; Yu, Y; A Birrell, D
Dryad: Distributed data-parallel programs from sequential building blocks
2007 Apr10
Zaharia, Matei; Konwinski, Andy; Joseph, Anthony D.; Katz, Randy; Stoica, Ion
Improving mapreduce performance in heterogeneous environments
2008 Apr10
Kwon, YongChul; Balazinska, Magdalena; Howe, Bill; Rolia, Jerome
SkewTune: Mitigating Skew in MapReduce Applications
2012 May12
Thusoo, Ashish; Sarma, Joydeep Sen; Jain, Namit; Shao, Zheng; Chakka, Prasad; Anthony, Suresh; Liu, Hao; Wyckoff, Pete; Murthy, Raghotham
Hive - A Warehousing Solution Over a Map-Reduce Framework
2009 Oct09
Borthakur, Dhruba; Sarma, Joydeep Sen; Gray, Jonathan; Muthukkaruppan, Kannan; Spiegelberg, Nicolas; Kuang, Hairong; Ranganathan, Karthik; Molkov, Dmytro; Menon, Aravind; Rash, Samuel; Schmidt, Rodrigo; Aiyer, Amitanand
Apache Hadoop Goes Realtime at Facebook
2011 Oct11
Olston, Christopher; Chiou, Greg; Chitnis, Laukik; Liu, Francis; Han, Yiping; Larsson, Mattias; Neumann, Andreas; Rao, Vellanki B. N.; Sankarasubramanian, Vijayanand; Rao, Vellanki B. N.; Siddharth, Seth; Tian, Chao; ZiCornell, Topher; Wang, Xiaodan
Nova: Continuous Pig/Hadoop Workflows
2011 Aug11
Eltabakh, MY; Tian, Y; Özcan, F; Gemulla, R; Krettek, A; McPherson, J
CoHadoop: flexible data placement and its exploitation in Hadoop
2011 Aug11
Jahani, Eaman; Cafarella, Michael J.; Ré, Christopher
Automatic Optimization for MapReduce Programs
2011 Mar11
Dittrich, J; Quiane-Ruiz, J; Jindal, A; Kargin, Y; Setty, V; Schad, J
Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing)
2010 Dec10
Nykiel, T; Potamias, M; Mishra, C; Kollios, G; N, Koudas
MRShare: Sharing Across Multiple Queries in MapReduce
2010 Dec10
Yu, Y; Isard, M; Fetterly, D; Budiu, M; Erlingon, Ú; Gunda, PK; Currey, J
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
2008 Apr10
Afrati, Foto N.; Ullman, Jeffrey D.
Optimizing Joins in a Map-Reduce Environment
2010 Aug11
Jiang, D; Ooi, BC; Shi, L; Wu, S
The Performance of MapReduce: An in-depth Study
2010 Dec10
Okcan, Alper; Riedewald, Mirek
Processing theta-joins using MapReduce
2011 Aug11
Pike, R; Dorward, S; Griesemer, R; Quinlan, S
Interpreting the data: Parallel analysis with Sawzall
2005 Oct10
Logothetis, Dionysios; Yocum, Kenneth
Ad-hoc data processing in the cloud
2008 Oct09
Tsangaris, M.; Kakaletris, G.; Kllapi, H.; Papanikos, G.; Pentaris, F.; Polydoras, P.; Sitaridi, E.; Stoumpos, V.; Ioannidis, Y.
Dataflow Processing and Optimization on Grid and Cloud Infrastructures
2009 Oct09
Yang, Hung-chih; Dasdan, Ali; Hsiao, Ruey-Lung; Parker, D. Stott
Map-reduce-merge: simplified relational data processing on large clusters
2007 Oct09
Beyer, Kevin; Ercegovac, Vuk; Krishnamurthy, Rajasekar; Raghavan, Sriram; Rao, Jun; Reiss, Frederick; Shekita, Eugene J.; Simmen, David; Tata, Sandeep; Vaithyanathan, Shivakumar; Zhu, Huaiyu
Towards a scalable enterprise content analytics platform
2009 Oct09
Yu, Yuan; Isard, Michael; Fetterly, Dennis; Budiu, Mihai; Erlingsson, Ulfar; Gunda, Pradeep Kumar; Currey, Jon; McSherry, Frank; Achan, Kannan; Poulain, Christophe
Some sample programs written in DryadLINQ
2008 Apr10
DeWitt, D; Stonebraker, M
Mapreduce: A major step backwards
2008 Oct09
Stonebraker, Michael; Abadi, Daniel; DeWitt, David J.; Madden, Sam; Paulson, Erik; Pavlo, Andrew; Rasin, Alexander
MapReduce and parallel DBMSs: friends or foes?
2010 Jan10
Kolb, L; Thor, A; Rahm, E
Multi-pass sorted neighborhood blocking with MapReduce
2011 Aug11
Kolb, L; Köpcke, H; Thor, A; Rahm, E
Learning-based Entity Resolution with MapReduce
2011 Aug11
Afrat, Foto N.; Sarma, Anish Das; Menestrina, David; Parameswaran, Aditya; Ullman, Jeffrey D.
Fuzzy Joins Using MapReduce
2012 Sep12
White, Tom; Gray, Jonathan; Stack, Michael
Hadoop: The Definitive Guide MapReduce for the Cloud - MapReduce for the Cloud
2009 Jan10
Lu, Jiaheng
Introduction to cloud computing
2009 Jan10

Hive User Meeting August 2009 Facebook
2009 Jan10
Gates, Alan F.; Natkovich, Olga; Chopra, Shubham; Kamath, Pradeep; Narayanamurthy, Shravan M.; Olston, Christopher; Reed, Benjamin; Srinivasan, Santhosh; Srivastava, Utkarsh
Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience
2009 Apr10