Posts tagged Project
1000 Genomes Project and AWS: A Startup Opportunity?
Apr 4th
Last week, Amazon welcomed the 1000 Genomes Project data to Amazon S3 as part of the Obama administration’s big data initiative (PDF). The initiative is far-reaching and likely to have an impact on a number of businesses. Focusing just on the 1000 Genomes Project, though, I wonder if this might be an opportunity for startups to provide tools or services around the data.
The 1000 Genomes Project is attempting to build “a comprehensive resource on human genetic variation” to “find most of the genetic variations that exist in people” (PDF) by studying DNA collected “from many people whose ancestors were from various parts of the world, and then putting all of this information in scientific databases on the Internet.”
Data Challenges
The 1000 Genomes Project may not be the biggest of big data projects, but it certainly fits the bill as big data. Despite the name, the project has actually collected the full genomic sequence from more than 1,700 people, and continues to add more samples. The donors are “mostly anonymous,” and each donor has consented to participate, if you were concerned about data privacy. Right now, the 1000 Genomes data clocks in at 200TB, which has been a bit of a challenge to distribute and a challenge for companies that work with the data to gather.
Dr. Brandon Colby, CEO and Medical Director of Existence Genetics, says his company has been working with the data for eight or nine months already. The size of the data, says Dr. Colby, has been a roadblock.
Dr. Colby says that the company works with the 1000 Genomes data as a baseline data set to test their tools for analyzing genomes. The clients are those who are “healthy, and looking to stay healthy.” They submit their DNA to find out if they may have markers that show risk factors for heart disease, prostate cancer, and so on. Existence Genetics provides a report to the client and their health care professional, which is used to help take steps to prevent disease.
The standard for genetic research for the past 30 years, says Dr. Colby, is DNA chips that hold about 5MB of data and “tens of thousands of data points.” That’s because older methods of genetic testing looked for only specific parts of the gene set and ignored everything else. The data from the 1000 Genomes Project – and other testing being done currently – samples the complete genome. That gives a file for each individual that can be between 5GB and 1TB. That presents “a lot of technical issues to get beyond,” says Dr. Colby.
Startup Opportunity?
Herein may be the opportunity for some enterprising data scientists, but Dr. Colby says that would be “very difficult to capitalize on.” The problem, he says, is that the 1000 Genomes Project is providing data in a format that is very new to geneticists. “Old-school geneticists” – which he describes as those who studied in the 90s or earlier – are used to DNA chips that contain a small subset of the information contained in the 1000 Genomes data.
There’s also the fact that this is a very niche market, Dr. Colby says. But he says that if you could find the right team that can produce the right tools for understanding the 1000 Genome data, and other data like it, it could be a good opportunity.
Donnie Berkholz, an analyst with RedMonk who focuses on big data, agrees that there’s an opportunity here. “I think the biggest opportunity lies in integrating this data with the other public datasets on AWS, as well as private, in-house data. Once you’re at this scale of data, simply moving it around becomes infeasible, so this announcement is a big deal because it puts all this data in a place where so much other data and computational power already exists.”
He also notes that while some of the academic research community is “cautious” about public clouds, “bioinformatics is bucking that trend. My expectation is that most of the researchers working with this scale of data are already using public clouds, because you simply can’t work effectively with this scale of data in most environments.”
So it could be that the 1000 Genomes data is the right data set, in the right place, just waiting for the right team.
View full post on ReadWriteWeb
How To Run Your PPC Accounts Like A Project
Feb 20th
Managing PPC accounts can be overwhelming. There is so much to-do, and no one ever has enough time. This leads most people to just make huge todo lists of items they either should be doing, or want to eventually do inside their account. The problem with to-do lists is that they are easy to ignore….
Please visit Search Engine Land for the full article.
View full post on Search Engine Land: News & Info About SEO, PPC, SEM, Search Engines & Search Marketing
Googleplex Project X: Google’s New Secret Hardware Testing Lab
Feb 13th
Reportedly, Google is in the process of completing over $120 million worth of renovations and projects for its Mountain View headquarters. The latest project, which includes building a “secret lab” to test new hardware and devices, is being headed up by Google co-founder Sergey Brin and has been named “Project X.” According to news reports, [...]
Follow SEJ on Twitter @sejournal
View full post on Search Engine Journal
SEO Project Manager – Bizcommunity.com
Jan 16th
|
SEO Project Manager
Bizcommunity.com We are seeking the skills of a self-starting SEO project manager to manage and build an SEO team with an aim to rank our websites for multiple highly competitive casino related keyword terms and phrases. You must have strong people and project … |
View full post on SEO – Google News
Google Earth Funds Sea Turtle Tracking Project and Game
Jan 14th
Through the use of the Google Earth API technology and a grant given to the sea turtle conservation network WIDECAST, Google is funding the tracking of a sea turtle named Jklynn as she follows an ancestral path to create nests across the Carribean…
View full post on Search Engine Watch – Latest
Infographic: The 25 Most Important Online Project Management Solutions
Jan 11th
GetApp.com, an independent marketplace for online business software has released an infographic comparing online project management software solutions to help businesses choose the right product. There are some interesting trends and data on the chart, including their age and size, their emphasis on social media, and whether they offer integration with Google, Intuit and Salesforce products and have their own API as well. There is also information about whether Android or iOS versions of each app are available.
It is a pretty nice collection of different pieces of information and useful if you are in the market for this kind of software.
View full post on ReadWriteWeb
Seo Taiji in the U.S. to Work on 20th Anniversary Project – Soompi
Jan 5th
|
Seo Taiji in the U.S. to Work on 20th Anniversary Project
Soompi Seo Taiji is set to stage a comeback this year, especially to celebrate his 20 th anniversary. The singer, one of the influential and respected figures in the music industry, is working on material for release to commemorate his milestone year in the … |
View full post on SEO – Google News
Google Launches “Schemer” Project & Activity Finder
Dec 14th
Google has launched a content-sharing tool that’s all about finding new things to do in the real world. This project lets users create, share, find, discuss, and track progress on “schemes” for goals, activities, and adventures.
TechCrunch first…
View full post on Search Engine Watch – Latest