You are hereHistory of Grand Prize Team and Vandelay Industries !

History of Grand Prize Team and Vandelay Industries !



Grand Prize Team

Grand Prize Team, captained by Gabor Takacs of Gravity R&D, has been one of the leading teams in the Netflix Prize competition throughout 2009. At one point this spring, the slimmest of possible margins (0.01%) separated Grand Prize Team in third place from the two teams tied for the lead. Leading shareholders in Grand Prize Team (a.k.a. GPT) include Joe Sill, Ces Bertino and the members of the two teams who founded GPT, Gravity and Dinosaur Planet.

GPT was founded on the notion that collaboration was the key to victory in the Netflix Prize competition. Gravity and Dinosaur Planet had already shown what collaboration could accomplish. They had previously joined forces to form the team When Gravity and Dinosaurs Unite, rising to first place on the leaderboard on the day before the deadline for submissions for the first Progress Prize of the competition in October 2007. They were edged out by the AT&T team KorBell in the final hours, but the power of partnerships had been demonstrated in dramatic fashion.

Gravity is a team of four researchers from Hungary: Gabor Takacs (Szechenyi Istvan University, Gyor) and Istvan Pilaszy, Bottyan Nemeth, and Domonkos Tikk (Budapest University of Technology and Economics). They form not only a Netflix Prize team but also the core of a company, Gravity R & D. Gravity held the top spot on the leaderboard for several months during the first year of the competition. Dinosaur Planet, another leading Netflix Prize team during the first year, was formed by three students from the Princeton class of 2007: Lester Mackey, David Weiss, and David Lin. Mackey and Weiss have since moved on to computer science PhD programs at UC Berkeley and University of Pennsylvania, respectively, while Lin works in finance in New York.

Gravity and Dinosaur Planet decided in January 2009 to take the concept of collaboration to a new level by creating a team which issued a standing invitation to any other Netflix Prize competitor to submit techniques and results to be assessed for the potential to boost the score of the combined team. The deal offered to the rest of the competitors seemed eminently fair. At the time of formation, the union of the two founding teams had already achieved 9% of the 10% improvement required to win the Netflix Prize, but that final 1% was such a daunting challenge that the founders were willing to offer a two-thirds share of the prize ($666,666 USD) to additional collaborators. Shares would be granted in proportion to the size of the contribution each collaborator made towards that elusive 1%. Thus, an improvement in the leaderboard score of 0.0001, or just one basis point (0.01%), was likely to be worth nearly seven thousand dollars. Gabor Takacs was chosen to captain the new team, which was named Grand Prize Team out of a spirit of optimism.

Bottyan Nemeth of Gravity designed a server which could quickly yet rigorously analyze files produced by models from GPT applicants, searching for indications that the applicant's submission could improve the team's score by providing something complementary to the models the team had already developed. Anyone could upload submissions at any time of day or night and get a quick response from the server indicating whether the submission was promising. Applicants were also invited to send modeling software to GPT for evaluation for possible synergies. Submitting a result which could help GPT was a difficult task, given the high position on the leaderboard which the team already held. Many applicants were unable to demonstrate any ability to improve upon the set of models GPT had already developed.

Nonetheless, the "open invitation" strategy quickly paid off when Ces Bertino, a software engineer working in San Diego, submitted results to GPT. Bertino already had one of the best single-person teams in the entire competition and had held a top 10 position on the leaderboard for many months. Bertino's submissions provided an improvement of 21 basis points (0.21%) - an enormous jump in a competition where leading teams would rejoice when making improvements of just a few basis points.

Grand Prize Team then collected additional important contributions from Netflix Prize competitors from around the world, such as Wojtek Kulik, Bill Roberts and Willem Mestrom. Kulik is a predictive modeling researcher and entrepreneur in Warsaw, Poland. Roberts is a researcher in statistical signal processing and a part-time faculty member at George Washington University in Washington DC. Mestrom is a computational scientist and software engineer in the Netherlands.

Joe Sill, a machine learning PhD from Caltech with dotcom and finance experience, submitted technology in February which proved highly promising when evaluated by Takacs. After refining the software Sill submitted, Takacs found that it could boost GPT's score by 14 basis points, another major jump. Then Dan Nabutovsky, an algorithm designer from Israel, contributed a substantial improvement of 6 basis points, and the possibility that GPT might challenge for the top spot on the leaderboard began to look more likely. Sill continued to submit enhancements and accrued 8 more basis points of improvement, eked out a few basis points at a time. As a result, GPT rose to within just one basis point of the top position on the leaderboard in May, achieving a score of 0.8597 while Pragmatic Theory and BellKor in Big Chaos (two teams who would later merge to form BellKor's Pragmatic Chaos) were tied in first, each with a score of 0.8596. When BellKor's Pragmatic Chaos broke the million dollar barrier in late June, Grand Prize Team had a score of 0.8594, the best score of any team not participating in the BellKor's Pragmatic Chaos coalition.

After the breaking of the million dollar barrier, only 30 days remained for other teams to attempt to catch BellKor's Pragmatic Chaos. For the final push, GPT brought on board David Purdy, who is finishing a statistics PhD at UC Berkeley. Purdy contributed a number of analyses which had not yet been pursued by other GPT members and a wide-ranging perspective on the statistics literature.

In the closing days of the competition, GPT decided that it wasn't finished pursuing a cooperative approach to the Netflix Prize. Talks with another leading team, Opera Solutions and Vandelay United, led to the formation of The Ensemble.

Writing credits:

Joe Sill


Vandelay Industries !

On January 1st, 2009, dreamhost.com had a 95% off sale. You could purchase a two year web hosting plan that included shell access to a Linux server, unlimited users, and unlimited storage space for $20.00. Greg McAlpin (OfADifferentKind), a software developer in Houston, TX area, bought a two-year subscription with vague notions of setting up a website some day.

On February 25, 2009, Greg's Probe File Exchange website went online. It was an invitation-only website where members could upload their probe prediction files and see how their score in the Netflix Prize contest might improve if they were to combine their results. Probe prediction files are files that competitors could use to measure the effectiveness of their algorithms. Netflix supplied a suggested set of probe data. Since we were all using the data Netflix suggested, it was easy to compare our results.

Greg invited six people to join the Probe File Exchange. They were not chosen because they had the lowest scores. They were chosen because they were all active on the Netflix forum and their posts were consistently helpful, friendly, and funny. They were chosen because they are the sort of people that you want to work with. The Netflix forum ( http://www.netflixprize.com//community/) is the place where competitors could ask questions and help each other. There has been amazing openness in the forum. People have shared everything from ideas to source code. Five of the six people who were invited on February 25th are now members of The Ensemble.

On the first day that the Probe File Exchange was online, Bo Yang (Newman !) proposed to Greg that they create a new joint team. Bo and Greg went on to form the team "Newman and George !". They hoped that a submission of their combined files would have an RMSE lower than 0.8712 (the 2007 progress prize RMSE). RMSE, or Root Mean Squared Error, is a way of measuring the average error for a set of predictions. On February 27th, Newman and George ! made their first submission with an RMSE of 0.8689.

In order to share files, members created directories on the same Linux server that was hosting the website (on dreamhost.com). That original setup grew into the infrastructure that allowed Vandelay Industries ! to easily support many members.

Bill Bame (clueless) began uploading files the first day that the Probe File Exchange was online. He has always had extremely creative ideas and unique approaches. The files that he uploaded to the Probe File Exchange combined extremely well with those of Newman and George !. On February 26th, Bill was invited to a new team named "Newman, George, and Peterman !".

The Probe File Exchange had its own private forum where members could share ideas. Chris Hefele (chef-ele) posted some information about the non-linear ways that he used to combine files. The most common way for competitiors to combine files is linear regression. That's a mathematical way of taking many points and finding the line that passes nearest to all of the points. Nonlinear regression is much more complex. It attempts to find a curve that passes closest to all of the points. The results that Chris achieved were extremely impressive. On March 12, 2009, Chris was invited to join the team. He was going to be "Bania", but the team name was growing too long and the "Newman and ... !" teams were all on the front page of the leaderboard.

So a new team "Vandelay Industries !" was formed. The name "Vandelay Industries !" is of course a whimsical reference to "Seinfeld", as is our goal to become a coalition "for the rest of us" who are not at the top of the leaderboard. Chris continued to develop his blending techniques and he continued to produce amazing results. He is one of the main blenders on The Ensemble.

In March, George Tsagas of Feeds2 was invited to join Vandelay Industries !. He answered "not yet". He was already part of one of the leading teams and he said that there would be time to make collaborations when the leaders' improvement neared the 10% mark. Feeds2 is now a member of The Ensemble.

During March and the beginning of April, Vandelay Industries ! continued to make almost daily progress. In May, Bo made huge improvements in his personal score. With his improvements Vandelay Industries !, made up of four people working in their spare time, reached 15th place on the leaderboard among 5000+ teams.

Vandelay Industries ! was started by sending out emails to strangers asking if they wanted to work together. The team made contacts with other top teams and started dialogs with them. The person who gave the most help and encouragement was Larry Ya Luo (Dace). Larry/Dace is also the highest ranked single-member team on the Netflix Prize leaderboard. There was some disagreement about how Vandelay Industries ! should recruit new members. Some thought that we should contact teams lower than us on the leaderboard. They would be more likely to work with us. And we had already seen that a few people with no previous experience could achieve quite a bit by working together. There was hesitation about contacting the top teams on the leaderboard because Vandelay Industries ! really had nothing to offer them.

But the possibility that someone might turn us away has never deterred the team. We asked Larry if he would mind downloading our probe files and seeing how they mixed with his. He accepted, downloaded our files, and did significant analysis of them. Even though our files could barely improve his own, he offered suggestions for how we could make improvements and what he thought we needed to do to reach the top 10 on the leaderboard. Each time that Vandelay Industries made a significant improvement, Larry would look at our files and try to help us.

In June, Jeff Howbert (team Howbert) contacted Bo about combining efforts, and Jeff joined Vandelay Industries !. The team was preparing a new submission and was quietly confident that Vandelay Industries ! would get into the top 10 on the leaderboard for the first time. Then BellKor's Pragmatic Chaos made their submission that made a 10.05% improvement.

Immediately Vandelay Industries ! began sending emails to all of the top teams, inviting them to join or cooperate with Vandelay Industries !. Larry was one of the first to agree to join our team. Others followed. The infrastructure that we had in place made it simple for us to add more teams. People were able to quickly integrate into the team and become productive.

As the final moments of the competition approached on July 24th 2009, Greg McAlpin (OfADifferentKind) and Christopher Hefele (chef-ele) reflected on what the contest meant to them, the unique qualities Vandelay Industries ! offered The Ensemble, and what's next for the group.

"Joining with Grand Prize Team to create The Ensemble put us in the incredible position of making a 10% improvement over the Cinematch program that Netflix uses" said Greg Mcalpin. Larry has said it well: our goal was to make a 10% improvement. When we do that, we'll have finished successfully with a job well done. "A million dollars isn't why we've worked so hard", Greg says, "at the beginning of the contest, a lot of people said that it would be impossible for anyone to reach the 10% improvement. From February until now, in six months, this group has done the impossible." Greg continued: "if we come in second place or last place, it has been fun and it's been an awesome experience working with the great and brilliant people on this team."

"The merged team's name 'The Ensemble' not only refers to the large group of team members that's been merged together, but it's also a reference to "ensemble methods,'" says Chris/chef-ele, "which is the term researchers use for the techniques we're using to combine our individual predictions into a group prediction that is better than any of the individuals. "

"Next, although some of our teammates have formal backgrounds in machine learning, they're working side-by-side with many others who were drawn to this problem as an interesting hobby or puzzle", Chris/chef-ele says "it's like a data-miner's Rubik's Cube...very addictive."

"It's my opinion that the successes of this team is not only being driven by the technologies we're using to combine or data, but also by our ability to combine many people together & create a cohesive, functioning team in less than 30 days", Chris continued, "So our successes will be not only technological, but also organizational. It will be interesting to see if a large group of underdogs can defeat a small group of the leaders."

Writing credits:

Chris Hefele
Greg McAlpin
Susie Murphy (susie@loganmurphy.com)
Joe Sill

I have bookmarked your blog, the articles are way better than other similar blogs.. thanks for a great blog!
christian h. girlfriend activation system guide review

I am a new user of this site so here i saw multiple articles and posts posted by this site,I curious more interest in some of them hope you will give more information on this topics in your next articles. Glad to chat your blog, I seem to be forward to more reliable articles and I think we all wish to thank so many good articles, blog to share with us. rebelmouse.com

article marketing I found your this post while searching for some related information on blog search...Its a good post..keep posting and update the information.

What a good blog you have here. Please update it more often. This topics is my interest. Thank you. . . Housing News

It's a shame you don't have a donate button! I'd most
certainly donate to this brilliant blog! I suppose for now
i'll settle for bookmarking and adding your RSS feed to my Google account.
I look forward to new updates and will share this website with my Facebook group.
Chat soon!

I am always searching online for articles that can help me. There is obviously a lot to know about this. I think you made some good points in Features also. Keep working, great job! cheap bridal shower invitations

wow, great, I was wondering how to cure acne naturally. and found your site by google, learned a lot, now i’m a bit clear. I’ve bookmark your site and also add rss. keep us updated.
lpg conversions

Mmm.. good to be here in your article or post, whatever, I think I should also work hard for my own website like I see some good and updated working in your site. wet room drain

I am grateful for your hard work. But if you did it in a simple procedure that would be in reality polite. But over all I exceedingly not compulsory you and certain will stay for more posts like this landing page

"It's my opinion that the successes of this team is not only being driven by the technologies we're using to combine or data, but also by our abilityHousing Market News

Positive site, where did u come up with the information on this posting? I'm pleased I discovered it though, ill be checking back soon to find out what additional posts you include.
download the ejaculation trainer by matt gorden

According to the observations I that I say good honest information that you have posted this as well as your website. I am also very grateful for the posts that have been given with good writing and very useful for me. liga champion
liga champion
Beni Blog
champion
liga champion
liga champion
Backlink Super SEO
Beni Blog

Check out our visualization page, where there's a large "map" of movie similarities that you can explore interactively.192.168.0.1

seo So while we are confident the powerful company exhibitions that offer real value are here to remain, the second part of the query is “Why are some reveals like AHR Expo doing so well while others battle

Easily, the article is actually the best topic on this registry related issue. I fit in with your conclusions and will eagerly look forward to your next updates. Just saying thanks will not just be sufficient, for the fantasti c lucidity in your writing. I will instantly grab your rss feed to stay informed of any updates.
formazione viterbo

Once I thought about things like: why such information is for free here? Because when you write a book then at least on selling a book you get a percentage. Thank you and good luck on informing people more about it!
formazione viterbo

have read your excellent post. This is a great job. I have enjoyed reading your post first time. I want to say thanks for this post. Keep it up guys and I share more in future post. japanese sword

Very good post .Thank you and good luck on informing people more about it… increase reverbnation plays

The first several months of my site there were no comments; just give it time; now they come in like crazy every day! Thanks. Auto-entrepreneur web

Great post full of useful tips! My site is fairly new and I am also having a hard time getting my readers to leave comments. Analytics shows they are coming to the site but I have a feeling “nobody wants to be first”.
sistemi di gestione viterbo

Hey – great blog, just looking around some blogs, seems a really nice platform you are using. I’m currently using WordPress for a few of my blogs but looking to change one of them over to a platform similar to yours as a trial run. Anything in particular you would recommend about it? OC Housing News

Should there be another persuasive post you can share next time, I’ll be surely waiting for it. man made diamonds

Thank you for the good writeup. It actually
used to be a amusement account it. Look advanced to far delivered agreeable from you!
By the way, how could we keep in touch?

Feel free to surf to my site - ALL KWs

Well his is my first time i visit here. I found so many entertaining stuff in your blog, especially its discussion. From the tons of comments on your articles, I guess I am not the only one having all the leisure here! Keep up the excellent work. Flight Delays Compensation

the articles are way better than other similar blogs.. thanks for a great blog!
best amino acids supplements

Many thanks on your uncover. I found your web site ideal for your would really like. Its content has great along with tips. We have now investigation a lot of them along with purchased a whole bunch off of their particular web site. promo staff in london

SEO MILANO Great post full of useful tips! My site is fairly new and I am also having a hard time getting my readers to leave comments. Analytics shows they are coming to the site but I have a feeling “nobody wants to be first”

I just want to let you know that I just check out your site and I find it very interesting and informative..
used cars

I just thought it may be an idea to post incase anyone else was having problems researching but I am a little unsure if I am allowed to put names and addresses on here. Formula 1

Once I thought about things like: why such information is for free here? Because when you write a book then at least on selling a book you get a percentage. Thank you and good luck on informing people more about it! sem

wow this good but ,I like your post and good pics may be any peoples not like because defrent mind all poeple , fly tickets

Your website is cool I am impressed by the information that you have on this site.
pizza cu piept de pui

Thanks for sharing the info, keep up the good work going.... I really enjoyed exploring your site. good resource...
how can i make a woman want me

wow, great, I was wondering how to cure acne naturally. and found your site by google, learned a lot, now i’m a bit clear. I’ve bookmark your site and also add rss. keep us updated. blog comments

Well Great post full of useful tips! My site is fairly new and I am also having a hard time getting my readers to leave comments. Analytics shows they are coming. Medical Negligence Claim

Thanks for your article on the traveling industry.
We would also like to include that if your senior taking into account traveling,
it really is absolutely vital that you buy traveling insurance
for older persons. When traveling, golden-agers are at biggest risk of experiencing
a medical emergency. Receiving the right insurance package for your age group can safeguard your health and provide you with peace of mind.

Well I’ve been following your blog for a while now and finally got the courage to go ahead and give you a shout out from Kingwood Texas! Just wanted to mention keep up the good work! Delayed Flight Compensation

This is my first time visit to your blog and I am very interested in the articles that you serve. Provide enough knowledge for me. Thank you for sharing useful and don't forget, keep sharing useful info.http://tokotaswanitabranded.com
tas lv terbaru

This blog is so nice to me. I will keep on coming here again and again. Visit my link as well .. HowMuchDoesanAppCost

This is my first time visit to your blog and I am very interested in the articles that you serve. Provide enough knowledge for me. Thank you for sharing useful and don't forget, keep sharing useful info. LedColourChangingOutdoorCubeChair

We are unquestionable it is possible to fashion as a result any kind of multifarious of huge all over the place definitely what exactly you have to have a good laugh. appleby dry lining box

it is possible to fashion as a result any kind of multifarious of huge all over the place definitely what exactly you have to have a good laugh.www.rebelmouse.com

There are many resources on the internet if you’re looking to make your own and tastes vary widely ​ciekawe czcionki

useful information on topics that plenty are interested on for this wonderful post.Admiring the time and effort you put into your b!.. pelletkachel groningen

So luck to come across your excellent blog. Your blog brings me a great deal of fun.. Good luck with the site. artist promotion services

They charge a setup fee of $200k and then charge $200k per year to run the system. They still sell 12 systems per year and customers rave about the cloud-based solutions and how it is benefitting their publishing business.pure garcinia cambogia extract

I just want to be able to enable you to realize This i simply just verify your current web page IN ADDITION TO my spouse and i find it very interesting IN ADDITION TO informative. fitness pal

Thanks intended for posting the particular info.We usually are truly grateful for your blog post. Tourism

Its a great pleasure reading your post.Its full of information I am looking for and I love to post a comment that "The content of your post is awesome" Great work espiar