A Data Science Central Community
Over the past few years, the Internet of Things has given all physical things, a complete digital makeover. Today, a wide variety of things are being computerized and linked to networks, ranging from cars, watches, appliances, and clothes. Even farm fields and roadways are being upgraded and modified with digital sensors. This optimistic outlook has helped many companies and industries to redesign their operations.
Vodafone, recently signed a deal with Inmarsat, to use their satellite…Continue
Added by Pragati P. on October 24, 2016 at 4:00am — No Comments
Web scraping (also termed web data extraction, screen scraping, or web harvesting) is a web technique of extracting data from the web, and turning unstructured data on the web into structured data that can stored to your local computer or a database.
The web scraping technique is implemented by web scraping software tools. These tools interacts with websites in the same way as you do when using a…Continue
Added by Paul Black on September 22, 2016 at 11:00pm — No Comments
The modern world seems really fast and dynamic with a multitude of new products being launched. Marketing agencies are making fortune by monitoring the markets and delivering reports on consumers’ opinions. For today, the feedback analysis is a separate area, let’s say a growing industry with an array of products and services. And the prices for those services are pretty exorbitant.
So, do vendors have a chance to cut down…Continue
Added by Yana Yelina on August 12, 2016 at 12:00am — No Comments
Probability and physics are helping make even roulette seem ultimately predictable.
In his new book, The Perfect Bet: How Science and Math Are Taking the Luck Out of Gambling, Adam Kucharski details how trying to understand dice games led one mathematician to develop probability theory,…Continue
Added by Edward Turner on July 19, 2016 at 4:30pm — No Comments
Big Data is an accumulation of data that is too large and complex for processing by traditional database management tools.
Yeah But, What Really Makes Big Data Big Data? This question is as fundamental to data science as the chicken/egg question should be to researchers at KFC. But we’re not dealing with an A/B chicken model here. It’s more elephant to the dark room or scaling it up, the nearest star to our galactic…Continue
Added by Orion Stallard on July 8, 2016 at 12:54pm — No Comments
I want to share an interesting article about data scaping that you might need in your business. The article below is mainly reprinted from here.
Text in the HTML document is the content that placed between HTML tags like <a> </a> , <title> </title>. Sometimes we want to extract the text in the HTML document and there are two methods that can…Continue
Added by Nora Choi on May 31, 2016 at 2:30am — No Comments
Yarn Resource manager (The Yarn service Master component)
1) Controls of the total resource capacity of the cluster
2) Whatever the container is needed in the cluster it sets the minimum container size that is controlled by yarn configuration property
àyarn.scheduler.minimum-allocation-mb 1024(This value changes based on cluster ram capacity)
Description: The minimum allocation for every container request at the RM, in MBs.…Continue
Added by skumar T on May 30, 2016 at 8:00pm — No Comments
What does The Library of Alexandria, The Normans and a book have to do with data? I never thought about
...at Alexandria was in charge of collecting all the world's knowledge, and most of the staff was occupied with the task of translating works onto papyrus paper... 1
Or The Normans and the...
Domesday Book (Latin: Liber de Wintonia "Book of…
Added by George Psistakis on May 20, 2016 at 5:20am — No Comments
I want to share with you a good article that might help you better extract web data for your business.
Yesterday, I saw someone asking “which programming language is better for writing a web crawler? PHP, Python or Node.js?”and mentioning some requirements as below.
Added by Lucie Clayton on April 26, 2016 at 3:07am — No Comments
As a central repository and processing engine, data lakes hold great promise for raising return on data assets (RDA). Bringing analytics directly to different data in its native formats can accelerate time-to-value by providing data scientists and business users with increased flexibility and…Continue
Added by Gabriel Lowy on April 11, 2016 at 12:00pm — No Comments
Added by Igor Savula on April 5, 2016 at 1:30am — No Comments
As we evolve toward a software-defined world, there’s a new user experience urgency emerging. That’s because the definition of “user” is going to be vastly expanded. In the Internet of Things (IoT) era, users include machines.…Continue
Added by Gabriel Lowy on March 30, 2016 at 9:43am — No Comments
Microsoft and Acer make preparations to introduce new ‘Cloudbooks’ to take on their long time rival Google Chromebook.
For past many years, Google has been in the news for steering the idea of cost – effective and web –connected personal computers, that ensures minimum offline capability. The IT giant took every possible care to make sure these PCs were well supported by suite of applications (online) courtesy of cloud environment. Idea of sleek – client PCs loaded with greater…Continue
Added by Sarika Tadavi on March 30, 2016 at 12:00am — No Comments
Curse of Dimensionality:
One of the most commonly faced problems while dealing with data analytics problem such as recommendation engines, text analytics is high-dimensional and sparse data. At many times, we face a situation where we have a large set of features and fewer data points, or we have data with very high feature vectors. In such scenarios, fitting a model to the dataset, results in lower predictive power of the model. This scenario is often termed as…Continue
Added by suresh kumar gorakala on February 28, 2016 at 9:30pm — No Comments
In our last blog we saw the key benefits of Data Lake, but let’s deep dive in to the internals of a Data Lake via discussing the key considerations and compositions.
Take in any solution considerations it is practical difficult to arrives with a one-size-fit-all architecture; hence it applies for a Data Lake too. Hence the Data Lake architecture considerations…Continue
Added by Kumar Chinnakali on February 13, 2016 at 10:42am — No Comments
With increasing frequency, CIOs are being asked by their senior management, “What’s our big data strategy?” But do you really need a big data strategy?
In our view, companies should instead focus on data…Continue
Added by Gabriel Lowy on January 26, 2016 at 11:48am — No Comments
Is your company poised to take advantage of three key trends in Big Data? Syncsort, a global leader in Big Data and mainframe software, recently released the results of its second annual Hadoop survey. Based on the survey results there are three areas that companies will focus on in 2016, to realize the full potential of Big Data analytics.
First, Apache Spark will move from a talking point into deployment. Nearly 70 percent of survey respondents are interested in Apache…Continue
Added by John McCure on January 22, 2016 at 4:00pm — No Comments
FlyElephant team wishes a Happy New Year. We started this year with an expansion of tools' list, webinars and the formation of a community around the platform.
FlyElephant is a platform that provides scientists with computing infrastructure for calculation and automates routine tasks and allows focus on the core issues of research.
Octave and Scilab have now…Continue
Added by Dmitry Spodarets on January 12, 2016 at 4:11am — No Comments
Gets Tweets from Twitter:
Added by suresh kumar gorakala on January 11, 2016 at 6:00am — No Comments