Subscribe to our Newsletter

All Blog Posts (257)

IoT Continues to Makeover Many Industries; Adoption of IoT to Increase in Emerging Nations: Internet of Things Market:

Over the past few years, the Internet of Things has given all physical things, a complete digital makeover. Today, a wide variety of things are being computerized and linked to networks, ranging from cars, watches, appliances, and clothes. Even farm fields and roadways are being upgraded and modified with digital sensors. This optimistic outlook has helped many companies and industries to redesign their operations.

Vodafone, recently signed a deal with Inmarsat, to use their satellite…

Continue

Added by Pragati P. on October 24, 2016 at 4:00am — No Comments

Top 30 Free Web Scraping Software

Web scraping (also termed web data extraction, screen scraping, or web harvesting) is a web technique of extracting data from the web, and turning unstructured data on the web  into structured data that can stored to your local computer or a database.

The web scraping technique is implemented by web scraping software tools. These tools interacts with websites in the same way as you do when using a…

Continue

Added by Paul Black on September 22, 2016 at 11:00pm — No Comments

Smart Business: automated sentiments analysis on top

The modern world seems really fast and dynamic with a multitude of new products being launched. Marketing agencies are making fortune by monitoring the markets and delivering reports on consumers’ opinions. For today, the feedback analysis is a separate area, let’s say a growing industry with an array of products and services. And the prices for those services are pretty exorbitant.

So, do vendors have a chance to cut down…

Continue

Added by Yana Yelina on August 12, 2016 at 12:00am — No Comments

Can Science Create a System to Win at Roulette?

Probability and physics are helping make even roulette seem ultimately predictable.

In his new book, The Perfect Bet: How Science and Math Are Taking the Luck Out of Gambling, Adam Kucharski details how trying to understand dice games led one mathematician to develop probability theory,…

Continue

Added by Edward Turner on July 19, 2016 at 4:30pm — No Comments

Data Wars: Dawn of the Yottabyte

Big Data is an accumulation of data that is too large and complex for processing by traditional database management tools.

-Merriam Webster

 

Yeah But, What Really Makes Big Data Big Data?  This question is as fundamental to data science as the chicken/egg question should be to researchers at KFC. But we’re not dealing with an A/B chicken model here.  It’s more elephant to the dark room or scaling it up, the nearest star to our galactic…

Continue

Added by Orion Stallard on July 8, 2016 at 12:54pm — No Comments

7 Tools to extract text from HTML document

I want to share an interesting article about data scaping that you might need in your business. The article below is mainly reprinted from here

Text in the HTML document is the content that placed between HTML tags like <a> </a> , <title> </title>. Sometimes we want to extract the text in the HTML document and there are two methods that can…

Continue

Added by Nora Choi on May 31, 2016 at 2:30am — No Comments

Hadoop Yarn explanation and container memory allocations

Yarn Resource manager (The Yarn service Master component)

1) Controls of the total resource capacity of the cluster

2) Whatever the container is needed in the cluster it sets the minimum container size that is controlled by yarn configuration property

àyarn.scheduler.minimum-allocation-mb 1024(This value changes based on cluster ram capacity)

Description: The minimum allocation for every container request at the RM, in MBs.…

Continue

Added by skumar T on May 30, 2016 at 8:00pm — No Comments

Data has always existed, the key is the right data

What does The Library of Alexandria, The Normans and a book have to do with data? I never thought about

The Library...

...at Alexandria was in charge of collecting all the world's knowledge, and most of the staff was occupied with the task of translating works onto papyrus paper... 1

Or The Normans and the...

Domesday Book (Latin: Liber de Wintonia "Book of…

Continue

Added by George Psistakis on May 20, 2016 at 5:20am — No Comments

Which Language is Better For Writing a Web Crawler? PHP, Python or Node.js?

I want to share with you a good article that might help you better extract web data for your business.

Yesterday, I saw someone asking “which programming language is better for writing a web crawler? PHP, Python or Node.js?”and mentioning some requirements as below.

 

  1. The analytic ability to web page
  2. Operational capability to database(MySQL)
  3. Efficiency of crawling
  4. The…
Continue

Added by Nora Choi on May 19, 2016 at 6:30pm — 3 Comments

Decisions not data - how the 'crowd' helped me

There is a lot of talk now about data, big data, unstructured data etc. However, if I might be so bold, talking data first is putting the cart before the horse. Data is only valuable if it’s useful. As a data strategist and analyst, the focus of my work is on decisions not data, and it's not just me whose focus is shifting. PwC last year did a report on Big Decisions and this year CapGemini wrote a great article summarising their research and stating the case for excellence in data driven… Continue

Added by Lucie Clayton on April 26, 2016 at 3:07am — No Comments

Data Lakes Still Need Governance Life Vests

As a central repository and processing engine, data lakes hold great promise for raising return on data assets (RDA).  Bringing analytics directly to different data in its native formats can accelerate time-to-value by providing data scientists and business users with increased flexibility and…

Continue

Added by Gabriel Lowy on April 11, 2016 at 12:00pm — No Comments

Key Challenges Big Data Shartups are Facing

Continue

Added by Igor Savula on April 5, 2016 at 1:30am — No Comments

The IoT User Experience Urgency

As we evolve toward a software-defined world, there’s a new user experience urgency emerging.  That’s because the definition of “user” is going to be vastly expanded.  In the Internet of Things (IoT) era, users include machines.…

Continue

Added by Gabriel Lowy on March 30, 2016 at 9:43am — No Comments

Microsoft & Acer's New Cloudbooks Ready to Take on Google Chromebooks

Microsoft and Acer make preparations to introduce new ‘Cloudbooks’ to take on their long time rival Google Chromebook. 

For past many years, Google has been in the news for steering the idea of cost – effective and web –connected personal computers, that ensures minimum offline capability. The IT giant took every possible care to make sure these PCs were well supported by suite of applications (online) courtesy of cloud environment. Idea of sleek – client PCs loaded with greater…

Continue

Added by Sarika Tadavi on March 30, 2016 at 12:00am — No Comments

Principal Component Analysis using R

Curse of Dimensionality:

One of the most commonly faced problems while dealing with data analytics problem such as recommendation engines, text analytics is high-dimensional and sparse data. At many times, we face a situation where we have a large set of features and fewer data points, or we have data with very high feature vectors. In such scenarios, fitting a model to the dataset, results in lower predictive power of the model. This scenario is often termed as…

Continue

Added by suresh kumar gorakala on February 28, 2016 at 9:30pm — No Comments

Data Lake Architecture Considerations & Composition

In our last blog we saw the key benefits of Data Lake, but let’s deep dive in to the internals of a Data Lake via discussing the key considerations and compositions.

Architecture Considerations

Take in any solution considerations it is practical difficult to arrives with a one-size-fit-all architecture; hence it applies for a Data Lake too. Hence the Data Lake architecture considerations…

Continue

Added by Kumar Chinnakali on February 13, 2016 at 10:42am — No Comments

Do You Really Need a Big Data Strategy?

With increasing frequency, CIOs are being asked by their senior management, “What’s our big data strategy?”  But do you really need a big data strategy?

In our view, companies should instead focus on data…

Continue

Added by Gabriel Lowy on January 26, 2016 at 11:48am — No Comments

Three Big Data Trends for 2016

Is your company poised to take advantage of three key trends in Big Data? Syncsort, a global leader in Big Data and mainframe software, recently released the results of its second annual Hadoop survey. Based on the survey results there are three areas that companies will focus on in 2016, to realize the full potential of Big Data analytics.

         First, Apache Spark will move from a talking point into deployment. Nearly 70 percent of survey respondents are interested in Apache…

Continue

Added by John McCure on January 22, 2016 at 4:00pm — No Comments

New tools in FlyElephant and announcement of webinars

FlyElephant team wishes a Happy New Year. We started this year with an expansion of tools' list, webinars and the formation of a community around the platform.

FlyElephant is a platform that provides scientists with computing infrastructure for calculation and automates routine tasks and allows focus on the core issues of research.

Octave and Scilab have now…

Continue

Added by Dmitry Spodarets on January 12, 2016 at 4:11am — No Comments

Learn Everything about Sentiment Analysis using R

Today I will explain you how to create a basic Movie review engine based on the tweets by people using R. The implementation of the Review Engine will be as follows:
  • Gets Tweets from Twitter
  • Clean the data
  • Create a Word Cloud
  • Create a data dictionary
  • Score each tweet.

Gets Tweets from Twitter:

First step is to fetch the data from Twitter. In R, we have facility to call the twitter API using package…
Continue

Added by suresh kumar gorakala on January 11, 2016 at 6:00am — No Comments

Blog Topics by Tags

Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

On Data Science Central

© 2020   BigDataNews.com is a subsidiary of DataScienceCentral LLC and not affiliated with Systap   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service