Subscribe to our Newsletter

Featured Blog Posts (214)

New Books and Resources for DSC Members

We are in the process of writing and adding new material (compact eBooks) exclusively available to our members, and written in simple English, by world leading experts in AI, data science, and machine learning. We invite you to sign up here to not miss these free books. …

Continue

Added by Vincent Granville on February 4, 2020 at 8:00am — No Comments

Fast clustering algorithms for massive datasets

Here we discuss two potential algorithms that can perform clustering extremely fast, on big data sets, as well as the graphical representation of such complex clustering structures. By extremely fast, we mean a computational complexity of order O(n) and even faster such as O(n/log n). This is much faster than good Hierarchical Agglomerative Clustering…

Continue

Added by Vincent Granville on February 23, 2013 at 10:00pm — 4 Comments

Graph Theory: Six Degrees of Separation Problem

This famous statement -- the six degrees of separation -- claims that there is at most 6 degrees of separation between you and anyone else on Earth. Here we feature a simple algorithm that simulates how we are connected, and indeed confirms the claim. We also explain how it applies to web crawlers: Any web page is connected to any other web page by a path of 6 links at most.

The algorithm below is rudimentary and can be used for simulation purposes by any programmer: It does not even…

Continue

Added by Vincent Granville on October 24, 2017 at 11:30pm — No Comments

Big Data: 50 Fascinating and Free Data Sources for Data Visualization

Have you ever felt frustrated when try to look for some data on Google? Pages of relevant websites but none can fulfill your expectation? Have you ever felt that your articles are less persuasive without data support?

General Data 
Continue

Added by Paul Black on October 30, 2017 at 7:30pm — No Comments

Role Or Impact Of Methodology To Research On Big Data

In recent situation there is a huge requirement to handle extends that enable organizations to change their techniques all the more rapidly so as to suspect the market and opposition.

With new advances, organizations have gotten themselves tossed into circumstances…

Continue

Added by Ethan Millar on January 18, 2020 at 6:30am — No Comments

Cloud Digital Asset Management and its Benefits

The idea of the digital asset doesn’t actually correspond to the real-world assets like gold, property, etc. rather it is more of your digital content like graphics, photos, documents, etc.

There’s far too much data generated by businesses online, so, how does one manage their digital assets? In order to bring the importance of DAM…

Continue

Added by Priti Shetti on August 13, 2019 at 4:20am — No Comments

Scraping Nasdaq news using Python

Stock trading has one of the most complex and complicated dynamics in the present day world. In today’s time, multiple algorithms and researches have been produced to understand the complexity of the stocks trading. There is an increasing effort to understand the system dynamics of stock trading to predict the emergent behavior of the stock prices.

In order to predict stock prices adequately, one needs to…

Continue

Added by Sandra K on July 7, 2019 at 10:00pm — No Comments

Debunking Forbes Article about the Death of the Data Scientist

In a recent article (February 2019) published in Forkes (see here) it was argued that there will be no data science job titles by 2029. The author wrote that Automation is coming for many tasks data scientists perform, including machine learning.

I disagree. If you haven't automated most of your tasks yet, you are not…

Continue

Added by Vincent Granville on February 4, 2019 at 4:30pm — No Comments

Invitation to Join Data Science Central

Join the largest community of machine learning (ML), deep learning, AI, data science, business analytics, BI, operations research, mathematical and statistical professionals: Sign up here. If instead, you are only interested in receiving our newsletter, you can subscribe here. There is no…

Continue

Added by Vincent Granville on September 8, 2018 at 10:18am — No Comments

10 Best Big Data Analytics Courses Online

Continue

Added by Paul Black on July 12, 2018 at 4:00am — No Comments

Five Ways Your Business Is At Risk Of Data Loss

Data loss comes in all shapes and sizes, from an accidentally deleted document to a catastrophic server failure that wipes out a critical database. Modern businesses are absolutely reliant on data, and on the infrastructure they use to store, process, and access that data. The grim reality is that those systems and the people who use them are fallible. Most business owners are aware that backups are important, but few have a full disaster recovery plan in…

Continue

Added by Karl Zimmerman on May 30, 2018 at 12:13pm — No Comments

Black Hat Data Science

Black hat data science consists of techniques designed to fool existing algorithms (Google search, Amazon rankings, and so on), compromising or tampering with the metrics -- especially ratios -- that they rely on, without actually physically touching or altering data stored in their databases. It exploits flaws in these algorithms, and it also relies on reverse engineering, to achieve its goal. So black hat data science is different from traditional hacking,…

Continue

Added by Vincent Granville on May 23, 2018 at 9:00am — No Comments

Most Popular Text Analytics Tools and Algorithms

With text analytics, various burning questions around the ‘why’ and ‘what’ of a piece or group of content can be answered. Examples like social media chatter around brand can create a supremely spiraling impact (remember the post which showed a Kentucky man was violently removed from…

Continue

Added by Preetish Panda on May 10, 2018 at 12:00am — No Comments

Making the Most of Marketing Automation Software for Your Business

2016 Google trends data showed an all-time high in searches for “marketing automation”, reflecting a sustained interest in the term and the concept. Focused more on nurturing rather than selling, marketing automation is centered on content that is relevant, personalized, and in line with what customers want.

Marketing automation makes use of various web-based services and software to carry out, manage, as…

Continue

Added by Rohit Chavan on May 11, 2018 at 7:00am — No Comments

A Local Backup Can't Keep Your Business's Data Safe

Every business depends on data. For both the teen who mows lawns at the weekend and globe-spanning internet giants like Google, data is vital.

It might be addresses written on a piece of paper or petabytes of information stored on thousands of powerful servers. If it’s lost, the business loses money, time, and the respect of its customers.

Most business leaders…

Continue

Added by Jay Caissie on May 22, 2018 at 10:00am — No Comments

This Artist Turns the Forest Floor into Data Visualizations

We’re all familiar with data visualizations—word clouds, pie charts, pivot tables—but how does one put enquiries in paint?  Patty Haller, a landscape artist from Seattle WA, may have figured that out.  Her training in finance at University of Washington and traditional oil painting at Gage Academy of Art raised more questions than it answered.…

Continue

Added by Julia Cook on March 26, 2018 at 1:30pm — No Comments

10 Machine Learning Algorithms You Should Know in 2018

The word “Big data” prevailed in 2017, and it’s going to keep prevailing in the following years. In our previous post, I’ve introduced some concepts about big data, machine learning, and data mining (see post: Understanding Big data, Data mining, and Machine Learning in 5 Minutes). Now let's dig deeper into Machine Learning with a brief walk-through of some most commonly…

Continue

Added by Paul Black on March 24, 2018 at 2:30am — No Comments

80 Best Data Science Books That Are Worthy Reading

Data science is probably the most popular concept nowadays. I believe that many people are looking for an entrance to get inside the industry, and I just happened to read an article that lists some great data science books that may be helpful for you. So I concluded it in this article and I’ve also given the books brief introductions, so you can choose the ones you’d like to read. Some of the data science books you can find it online, and I've given out the links. But most of them I…

Continue

Added by Paul Black on March 28, 2018 at 6:30pm — No Comments

What proves that Apache Spark is better than Hadoop?

There are certain cases where Apache Spark surpasses Hadoop. In this article, our experts will share their reviews about the things that make Apache Spark a superior choice over Hadoop.

Apache Spark is lightning fast cluster computing tool used by developers and programmers. This tool is up to 100 times faster than Hadoop MapReduce since it features faster-in-memory data analytics processing power. It is a Big Data framework that is used as a general purpose data processing engine on…

Continue

Added by Joseph Macwan on March 6, 2018 at 12:30am — No Comments

3 Requirements for an Enterprise Data Lake

There have been many articles written and talks given over the last several years on abandoning the Enterprise Data Warehouse (EDW) in favor of an Enterprise Data Lake with some passionately promoting the idea and others just as passionately denying that this is achievable. In this article, I would like to take a more pragmatic approach to the case and try and lay down a process that enterprises should consider for a data management architecture.

The focus is on data lakes for…

Continue

Added by Shanti Subramanyam on February 26, 2018 at 11:00am — No Comments

On Data Science Central

© 2020   BigDataNews.com is a subsidiary of DataScienceCentral LLC and not affiliated with Systap   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service