All about Data

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 28, 2010

Notes about Hadoop « I’m just a simple DBA on a complex production
By prodlife
It is more similar to an operating system: Hadoop has a file system (HDFS) and a job scheduler (Map-Reduce). Both are distributed. You can load any kind of data into Hadoop. It is quite popular – the last Hadoop Summit had 750 attendees
I’m just a simple DBA on a complex… – http://prodlife.wordpress.com/
HBase Digest, January 2010 « Sematext Blog
By abaranau
Cloud MapReduce faster/smaller/simpler than Hadoop MapReducehttp://code.google.com/p/cloudmapreduce/ but ec2-specific #hadoop #mapreduce 6 days ago; Java 6 update 18 is out – lots of GC and HotSpot fixes and improvements
Sematext Blog – http://blog.sematext.com/
The Ebooks Nest : Free Ebooks Download: CouchDB: The Definitive Guide
By sruthin
Interact with CouchDB entirely though HTTP using its RESTful interface * Model data as self-contained JSON documents * Handle evolving data schemas naturally * Query and aggregate data in CouchDB using MapReduce views
The Ebooks Nest : Free Ebooks Download – http://www.topviewed.info/
a little something something…: Unit Testing the Hadoop WordCount
By Alex Parvulescu
This will provide you with means to start and stop a mini-HDFS and a mini-MapReduce cluster, in other words, everything you need to run your tests locally. Let me run that by you again, because for me it was not all that clear at the
a little something something… – http://blog.pfa-labs.com/

Google Web Alert for: mapreduce

Re: avro in mapreduce
Getting Avro types passing through MapReduce is a good goal. > > I apologize for not seeing the issue before it was committed. I > accept some of the blame

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 27, 2010

Google News Alert for: mapreduce

O’Reilly Radar

Entries tagged with “machine-learning” from O’Reilly Radar
O’Reilly Radar
If you desire (near) real-time analysis, traditional SQL databases and MapReduce systems are batch-oriented (load all the data, then analyze), and might not

Google Blogs Alert for: mapreduce

Cloud MapReduce « DECISION STATS
By Ajay Ohri
Cloud MapReduce was initially developed at Accenture Technology Labs. It is a MapReduce implementation on top of the Amazon Cloud OS. By exploiting a cloud OS’s scalability, Cloud MapReduce achieves three primary advantages over other
DECISION STATS – http://decisionstats.wordpress.com/
computer: Re: [sage-devel] proposal to remove dsage from sage
By computer
Running on a small home-brew cluster should a) fall outside of Sage’s scope, and b) use existing libraries rather than home-cooked to a much larger extent than DSage does (use OpenMPI or Google’s Map/Reduce or something in-between).
computer – http://computer-yocher.blogspot.com/
Research Plan | sidudun
By sidudun
I want to explore MapReduce, a programming model for processing a large scale of data in a distributed environment. I heard about this model from some mailing lists and websites, surprised that the paper [pdf], the lecture notes and
sidudun – http://arifn.web.id/blog/
High Scalability – High Scalability – Applications Become Black
By Todd Hoff
A smart grid rents their backend sensor cluster for a map-reduce job that works very well on underpowered CPUs. A queue service offers a good price for reliable low latency queuing so some of your queue load is offloaded to that service
High Scalability – http://highscalability.com/
app expressions: Google’s App Engine
By Gary Mawdsley
There are a number of useful peripheral services like CloudFront and Map Reduce. In amazon’s model the app is deployed atop of an VMWare style machine image. We can deploy Ruby and JVM based apps there within a few minutes – more over
app expressions – http://mawdo.blogspot.com/

Google Web Alert for: mapreduce

Xindicate PDM: Message Forum: Key-value stores and MapReduce
Secondly, Google’s MapReduce framework begs for some attention, and there is a number of resources available to study at Google Code University:
Nabble – Mahout Developer List – [jira] Created: (MAHOUT-237) Map
[jira] Created: (MAHOUT-237) Map/Reduce Implementation of Document Vectorizer. This is a pure bag-of-words based Vectorizer written in Map/Reduce.
Amazon Web Services Developer Community : flakyness accessing
Thread: flakyness accessing mapreduce service flakyness accessing mapreduce service. Posted: Jan 9, 2010 4:46 PM PST
Ramon Chen: Cloud ‘N Clear « Parallel DBMS’, Hadoop and MapReduce
This new recently published article MapReduce and Parallel DBMS, Friends or Foe? is an excellent read and reminded me of one of my earlier posts around

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 27, 2010

Google Blogs Alert for: mapreduce

qizmt – C# MapReduce from MySpace « Polymath Aspirations
By Terence
The HornGet Project: Bringing “apt-get install” to .NET Projects – Billy McCafferty – Devlicio.us – Just the Tasty Bits · DL Myspace MapReduce ». qizmt – C# MapReduce from MySpace. By Terence. qizmt – Project Hosting on Google Code.
Polymath Aspirations – http://polymathaspirations.wordpress.com/
Ebookz.ir-The world leading online source of ebooks-Free
By admin
Intermediate CouchDB concepts, including views, the REST API, JSON, map/reduce, load balancing, replication, and scalability. How to develop full CouchDB applications to get the reader up and running with CouchDB development as quickly
Powered by Ebookz.ir – http://ebookz.ir/
Apress – Beginning CouchDB « Seif Sallam’s Blog
By Seif Sallam
The book helped me understand Views and how to use Map/Reduce to get data, also made me see the difference between SQL and Key/Value database systems, and so many other stuff. Also i learned new stuff like using CURL, and practice some
Seif Sallam’s Blog – http://seiftopia.wordpress.com/
Strange Adventures in Infinite Space: New Years Python Meme
By Turd Flop Down M’leg
Hadoop and map/reduce. I’m already familiar with the concepts, but I’d like to get some actual experience with it; pyGTK and pyGame. I’d really like to get into gui programming.. even tho cli rules. Ctypes.. I’ve been meaning to read up
Strange Adventures in Infinite Space – http://tfdml.blogspot.com/

Google Web Alert for: mapreduce

apache’s hadoop-mapreduce at
Mirror of Apache Hadoop MapReduce — Read more MAPREDUCE-1310. CREATE TABLE statements for Hive do not correctly specify delimiters.
Error in using Hadoop MapReduce in Eclipse – Stack Overflow
When I executed a MapReduce program in Eclipse using Hadoop, I got the below error. It has to be some change in path, but I’m not able to figure it out.
MapReduce Case Study – Fraud Detection Video
Watch MapReduce Case Study – Fraud Detection and hundreds of other videos about tech, mapreduce, hadoop, asterdata, aster data, online gaming,
master/slave with mapreduce and update – mongodb-user | Google Groups
I use map/reduce to process these data. During the map/reduce process I update some My question is if I run the map/reduce process on the slave will my

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 27, 2010

Google News Alert for: mapreduce

SYS-CON Media (press release)

25 Years of Big Data: From SQL To The Cloud
SYS-CON Media (press release)
Their solution was to adopt a simple parallel programming framework, MapReduce, in place of SQL. MapReduce and its open source version Hadoop are now widely
See all stories on this topic

Google Blogs Alert for: mapreduce

25 Years of Big Data: From SQL To The Cloud | Cloud Computing Journal
If SQL was the first generation Big Data tool, and MapReduce/Hadoop was the second generation tool, what might a third generation tool look like? To answer this, we need to look at the areas in which MapReduce/Hadoop are weak – those
Latest News from Cloud Computing Journal – http://cloudcomputing.sys-con.com/
DB Optimizer: Hadoop
By Kyle Hailey
For certain workloads, MapReduce and Hadoop can outperform even the most expensive commercial RDBMS software and associated hardware – and can do so using much cheaper commodity hardware, and without expensive software licenses.
DB Optimizer – http://db-optimizer.blogspot.com/
Electric Politician: MapReduce In Groovy
By Dom Farr
I came across an old post on Joel on Software about MapReduce and functions as first class citizens. You can read it here. I thought I’d have a go at doing the examples in Groovy. I know these examples have been abstraction already in
Electric Politician – http://domfarr.blogspot.com/
HADOOP FOR THE LONE ANALYST, WHY AND HOW | StyleFeeder Tech Blog
By Ben Clark
The Sqoop (sql-to-hadoop) project (in the contrib area of mapreduce) provides ways to do this with both jdbc and mysqldump, if you have a running MySQL instance. You could certainly get Sqoop to do what we did, but we actually went
StyleFeeder Tech Blog – http://blog.tech.stylefeeder.com/
Highlights – RAD Lab
By Fox
RAIN, Spikes, statistical map-reduce, are all different ways of dealing with workload modeling/scheduling; what happens if combined? Lots of passive log collection/analysis, why no active probing? HP has done some work on this in the
RAD Lab – Recent changes [en] – http://radlab.cs.berkeley.edu/wiki/Special:Recentchanges

Google Web Alert for: mapreduce

First look: Amazon brings MapReduce to the Elastic Cloud | ITworld
Based on Hadoop, MapReduce equips users with potent distributed data-processing tools.
Is MapReduce right for me? – Stack Overflow
I am working on a project that deals with analyzing a very large amount of MapReduce is good for scaling the processing of large datasets,
First look: Amazon brings MapReduce to the Elastic Cloud | The
Have you got a few hundred gigabytes of data that need processing? Perhaps a dump of radio telescope data that could use some combing through by a squad of
MapReduce « Agile Cat — Azure & Hadoop — Talking Book
Twitter で、#Hadoop, #MapReduce, #NoSQL などのタグを見ていると、こんな情報が 飛び交っていて、すご〜〜〜い ギャップを感じてしまいますね。

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 27, 2010

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 27, 2010

Google News Alert for: mapreduce

Excel Meets The Cloud
SYS-CON Media (press release)
By Bill McColl Faced with this information explosion, experienced programmers are now using parallel processing tools such as MapReduce/Hadoop,
See all stories on this topic
Enhanced Financial Applications for Clients of National Bank Direct Brokerage
SYS-CON Media (press release)
Faced with this information explosion, experienced programmers are now using parallel processing tools such as MapReduce/Hadoop, rather than SQL databases,
See all stories on this topic

Google Blogs Alert for: mapreduce

Map Reduce — How Cool is That? | Webs Developer
By web slinger
From time-to-time I hear a few mentions of MapReduce; up until recently, I avoided looking into it. This month’s CACM, however, is chock-full of MapReduce.
Webs Developer – http://www.websdeveloper.com/
Oracle to Present at the Largest Cloud Computing Event in the
Faced with this information explosion, experienced programmers are now using parallel processing tools such as MapReduce/Hadoop, rather than SQL databases, to analyze large repositories of stored, historical data.
Latest News from Cloud Computing Journal – http://cloudcomputing.sys-con.com/
DbWorld » Blog Archive » [Dbworld] CFP: WWW 2010 International
By admin
based on MapReduce, use racks of commodity servers with locally attached storage and are able to scale out quickly at low cost. In general, there is a need to assemble resources on demand – the motivation for Cloud Computing.
DbWorld – http://dbworld.lukasblunschi.ch/blog/
Advanced Analytics Predictions For 2010 « JCC.COM
By jccavalcanti
To support heterogeneous interoperability for in-database and in-cloud analytics, open development frameworks– especially MapReduce and Hadoop—will be adopted broadly by data warehousing and analytics tools vendors. In the coming year,
JCC.COM – http://jccavalcanti.wordpress.com/
Code Collaboration Twitter Tweets about Programming as of January
timekord: Reading about MapReduce http://bit.ly/8UDuIC a programming framework useful to scale and cloud. 2010-01-12 · Reply · milfredd: it’s too fucken early for programming. bye. 2010-01-12 · Reply
Code Collaboration – http://codecollaboration.com/

Google Web Alert for: mapreduce

First look: Amazon brings MapReduce to the Elastic Cloud – SFGate
2009-12-31 15:44:00 PST — Have you got a few hundred gigabytes of data that need processing? Perhaps a dump of radio telescope data that could use some
Hadoop Map/Reduce versus DBMS, benchmarks at Netuality
On the same 100-nodes RedHat cluster they compared Vertica (a well-known MPP), „ plain” Hadoop with custom-coded Map/Reduce tasks and an unnamed DBMS-X

MapReduce News

Posted by: Ajay Ohri on: January 27, 2010

Google News Alert for: mapreduce

SYS-CON Media (press release)

Oracle to Present at the Largest Cloud Computing Event in the World
SYS-CON Media (press release)
He has a PhD in Parallel Systems from Cambridge University, where he built a system for processing massive data sets using a MapReduce framework.
See all stories on this topic

Google Blogs Alert for: mapreduce

16 Different Clones You Can Build with Drupal | Key-value, Map
By mikel
Key-value, Map-reduce & SEO. a key-value and map-reduce open source technical blog. « PHP HTTP post example. 16 Different Clones You Can …. Key-value, Map-reduce & SEO is proudly powered by WordPress · Entries (RSS) and Comments (RSS).
Key-value, Map-reduce & SEO – http://www.conby.com/blog/
2009 – the “Long View”: Hadoop and Cloudera | The Virtualization
By Mike
In the long term, 2009 is likely to be known as the year that a commercial framework emerged around the Open source Hadoop framework for map-reduce.
The Virtualization Practice – http://www.virtualizationpractice.com/blog/
Technical Support Engineer job in Aster – San Carlos
By raju
Aster Data Systems located in San Carlos, Ca is a proven leader in high-performance analytic database systems for data warehousing – the first DBMS to tightly integrate SQL with MapReduce – providing deep insights on data analyzed on
Computer Jobs Blog – http://hotjobs.taragana.com/
HadoopHackDay was a major hit
By jon
-Elastic MapReduce (from Amazon) is a great way to quickly get started with hadoop (everyone was up and running in less than an hour, without installing anything on their laptops!). However, the versions of Hadoop and Pig that come with
Jonathan Boutelle’s home on the net – http://www.jonathanboutelle.com/
Eamonn O’Brien-Strain » How you might create a Scala matrix
By eamonn
The toStr returns a string representation of the matrix in a nice tabular format that is much easier to than the default toString method if List[List[Double]] , Note how this is done with two nested map / reduce pairs.
Eamonn O’Brien-Strain – http://www.eamonn.org/blog/

Google Web Alert for: mapreduce

Map Reduce

Google Alert – mapreduce

Posted by: Ajay Ohri on: January 27, 2010

Google News Alert for: mapreduce

Distributed data caches speed cloud applications
SearchSOA
He says that MapReduce, a method of analysis that divides a computation among several servers and then combines the results, can be more easily deployed
See all stories on this topic

Google Blogs Alert for: mapreduce

a little something something…: Troubleshooting the Hadoop
By Alex Parvulescu
First thing: set the Hadoop install directory in Eclipse: just go to Window – Preferences – Hadoop Map/Reduce and fill in the field Hadoop installation directory with the proper path. The trouble starts when you try to add a new Hadoop
a little something something… – http://blog.pfa-labs.com/
Hive @Facebook | Scalable web architectures
By Royans
It was this, that eventually forced Facebook, to build a new way of querying data from Hadoop which doesn’t require writing map-reduce jobs in java. That quickly lead to the development of hive, which does exactly what it was set out to
Scalable web architectures – http://www.royans.net/arch/
Logging: Unsexy, Important, and now Usable. | Road to Failure
By Bradford
Send alerts when something looks “strange”; Run Hadoop/MapReduce scripts to provide interesting analytics. If you had a log search engine, with various utilities built on top of it, you’d have an easy way to see what’s going on in the
Road to Failure – http://www.roadtofailure.com/
Google: Cluster Computing and MapReduce › ec2base
By admin
Google: Cluster Computing and MapReduce. http://code.google.com/edu/submissions/mapreduce-minilecture/listing.html. This submission contains video lectures and related course materials from a series of lectures that was taught to Google
ec2base – http://php-app-engine.com/
Never Mind the Others: Here’s Silicon Valley – ReadWriteStart
By Chris Cameron
Innovations such as Cloud/MapReduce, NoSQL/BigTable, so on so forth. It is easier to hire 3l33+ computer scientists in the Valley, vs. hiring normal computer guys outside of the Valley. So if the startup is one that is doing something
ReadWriteWeb – http://www.readwriteweb.com/

Google Web Alert for: mapreduce

[jira] Commented: (MAPREDUCE-1367) LocalJobRunner should support
[ https://issues.apache.org/jira/browse/MAPREDUCE-1367?page=com.atlassian.jira. plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12799448#
PeteSearch: MapReduce for Idiots
Photo by Stuart Pilbrow I’ll admit it, I was intimidated by MapReduce. I’d tried to read explanations of it, but even the wonderful Joel Spolsky left me
IDEALS @ Illinois: Breaking the MapReduce Stage Barrier
Thus, we develop a method to break the barrier in MapReduce in a way that improves efficiency. Careful design of our barrier-less MapReduce framework

Google Blog Alert- MapReduce

Posted by: Ajay Ohri on: January 27, 2010

Google Blogs Alert for: mapreduce

how to compile python .py file to .pyc file | Key-value, Map
By mikel
Key-value, Map-reduce & SEO. a key-value and map-reduce open source technical blog. « 16 Different Clones You Can Build with Drupal Website. Key-value, Map-reduce & SEO is proudly powered by WordPress · Entries (RSS) and Comments (RSS).
Key-value, Map-reduce & SEO – http://www.conby.com/blog/
@Gridify Cloud Computing: Introducing GridGain 3.0
By dsetrakyan
Full integration with GridGain MapReduce implementation. Integration with Hibernate Level2 cache. One of the richest Cache API on the market including support for functional programming with closures and predicates.
@Gridify Cloud Computing – http://gridgain.blogspot.com/
ESP – Product Engineering – Platform and Software Products
By Jayanti Vemulapati
MapReduce is a programming model and software framework for writing applications that rapidly process vast amounts of data in parallel on large clusters of compute nodes http://wiki.apache.org/hadoop/. One good example of an enterprise
ESP – Product Engineering –… – http://www.infosysblogs.com/engineering-software/
Programming Hadoop in Netbeans | Webs Developer
By web slinger
In this post, I’ll tell you step-by-step how to use Netbeans to develop a Hadoop MapReduce job. I’m using Netbeans 6.8 in Ubuntu Karmic Koala distribution.
Webs Developer – http://www.websdeveloper.com/
What Is MapReduce And Its Benefits by REDSUEDERED.COM
By Gregory Smith
MapReduce is made of two parts. The first part is the Map. Basically, this is the part that locates the data and “maps” them into different clusters. This means that Map is the first line that will identify the preliminary information
REDSUEDERED.COM – http://www.redsuedered.com/

Google Web Alert for: mapreduce

Slashgeo | USPTO Grants Google a Patent On MapReduce
Found on slashdot, here is their summary : "Two years ago, David DeWitt and Michael Stonebraker deemed MapReduce a major step backwards (here are the

Hello world!

Posted by: Ajay Ohri on: January 27, 2010

Welcome to WordPress.com. This is your first post. Edit or delete it and start blogging!


  • None
  • Mr WordPress: Hi, this is a comment.To delete a comment, just log in, and view the posts' comments, there you will have the option to edit or delete them.

Categories

Archives

Follow

Get every new post delivered to your Inbox.