LivingNightmare's Profile User Rating: -----

Reputation: 21 Tradesman
Group:
Contributors
Active Posts:
129 (0.09 per day)
Joined:
07-July 10
Profile Views:
4,210
Last Active:
User is offline Nov 13 2013 08:29 AM
Currently:
Offline

Previous Fields

Country:
CA
OS Preference:
Windows
Favorite Browser:
Internet Explorer
Favorite Processor:
Who Cares
Favorite Gaming Platform:
XBox
Your Car:
Who Cares
Dream Kudos:
25

Latest Visitors

Icon   LivingNightmare has not set their status

Posts I've Made

  1. In Topic: Processor Synchronisation

    Posted 9 Aug 2012

    I am not 100% sure what you're looking for, but I think it may be something like this: http://piccolo.news.cs.nyu.edu/ - I've used it before for processing large datasets in parallel on multiple nodes in a cluster. It's written in C++ and uses Google's MPI under the hood. The paper on the webpage would also give you a good idea of how Piccolo works, if you need. It also comes with a few examples as well (some in C++ and some in python). Some of the exampls include: Distributed Web Crawler, PageRank, and K-means Clustering. You can use piccolo to write code that performs a task in parallel. All of the load-balancing, syncronization and what not issues are handled under the hood for you. Hope it helps...
  2. In Topic: C++ Machine Learning Lib

    Posted 9 Aug 2012

    It's because the existing code base I have for the project is in C++, and that's the language I feel most comfortable with. Although I've already done a good chunk of coding in C++, what I wrote is mostly for pre-processing things, so I do have a little flexibility.

    Basically, the goal of this project is to try and cluster webpages based on a topic-scheme,
    and then partition these centroids on different nodes in order to improve search engine queries. Instead of having the entire cluster of nodes light up in order to perform a search, if we can determine the possible topics of the user's query, then only a few nodes would be required to light up :) - It's for a URA project I'm doing with one of the profs at my University. Since my original post, I have also come across this in Java: http://mallet.cs.umass.edu/topics.php, which seems pretty solid. The only problem is, this looks more of a supervised method, because I need to specify the number of topics (which I do not know ahead of time), along with a few other parameters.

    So I guess the actual clustering can be in any language, especially if it will work well for what I need it for (which is the unsupervised clustering/topic-seperation part).

My Information

Member Title:
D.I.C Head
Age:
23 years old
Birthday:
August 8, 1990
Gender:
Location:
Canada
Years Programming:
5
Programming Languages:
C/C++
Scheme

Contact Information

E-mail:
Click here to e-mail me
MSN:
MSN  living_nightmare99@hotmail.com

Friends

Comments

LivingNightmare has no profile comments yet. Why not say hello?