A Data Science Central Community
We are in the process of writing and adding new material (compact eBooks) exclusively available to our members, and written in simple English, by world leading experts in AI, data science, and machine learning. We invite you to sign up here to not miss these free books. …Continue
Added by Vincent Granville on February 4, 2020 at 8:00am — No Comments
Here we discuss two potential algorithms that can perform clustering extremely fast, on big data sets, as well as the graphical representation of such complex clustering structures. By extremely fast, we mean a computational complexity of order O(n) and even faster such as O(n/log n). This is much faster than good Hierarchical Agglomerative Clustering…Continue
This famous statement -- the six degrees of separation -- claims that there is at most 6 degrees of separation between you and anyone else on Earth. Here we feature a simple algorithm that simulates how we are connected, and indeed confirms the claim. We also explain how it applies to web crawlers: Any web page is connected to any other web page by a path of 6 links at most.
The algorithm below is rudimentary and can be used for simulation purposes by any programmer: It does not even…Continue
Added by Vincent Granville on October 24, 2017 at 11:30pm — No Comments
Have you ever felt frustrated when try to look for some data on Google? Pages of relevant websites but none can fulfill your expectation? Have you ever felt that your articles are less persuasive without data support?
Added by Paul Black on October 30, 2017 at 7:30pm — No Comments
In recent situation there is a huge requirement to handle extends that enable organizations to change their techniques all the more rapidly so as to suspect the market and opposition.
With new advances, organizations have gotten themselves tossed into circumstances…Continue
Added by Ethan Millar on January 18, 2020 at 6:30am — No Comments
The idea of the digital asset doesn’t actually correspond to the real-world assets like gold, property, etc. rather it is more of your digital content like graphics, photos, documents, etc.
There’s far too much data generated by businesses online, so, how does one manage their digital assets? In order to bring the importance of DAM…Continue
Added by Priti Shetti on August 13, 2019 at 4:20am — No Comments
Stock trading has one of the most complex and complicated dynamics in the present day world. In today’s time, multiple algorithms and researches have been produced to understand the complexity of the stocks trading. There is an increasing effort to understand the system dynamics of stock trading to predict the emergent behavior of the stock prices.
In order to predict stock prices adequately, one needs to…Continue
Added by Sandra K on July 7, 2019 at 10:00pm — No Comments
In a recent article (February 2019) published in Forkes (see here) it was argued that there will be no data science job titles by 2029. The author wrote that Automation is coming for many tasks data scientists perform, including machine learning.
I disagree. If you haven't automated most of your tasks yet, you are not…Continue
Added by Vincent Granville on February 4, 2019 at 4:30pm — No Comments
Join the largest community of machine learning (ML), deep learning, AI, data science, business analytics, BI, operations research, mathematical and statistical professionals: Sign up here. If instead, you are only interested in receiving our newsletter, you can subscribe here. There is no…Continue
Added by Vincent Granville on September 8, 2018 at 10:18am — No Comments
Added by Paul Black on July 12, 2018 at 4:00am — No Comments
Data loss comes in all shapes and sizes, from an accidentally deleted document to a catastrophic server failure that wipes out a critical database. Modern businesses are absolutely reliant on data, and on the infrastructure they use to store, process, and access that data. The grim reality is that those systems and the people who use them are fallible. Most business owners are aware that backups are important, but few have a full disaster recovery plan in…Continue
Added by Karl Zimmerman on May 30, 2018 at 12:13pm — No Comments
Black hat data science consists of techniques designed to fool existing algorithms (Google search, Amazon rankings, and so on), compromising or tampering with the metrics -- especially ratios -- that they rely on, without actually physically touching or altering data stored in their databases. It exploits flaws in these algorithms, and it also relies on reverse engineering, to achieve its goal. So black hat data science is different from traditional hacking,…Continue
Added by Vincent Granville on May 23, 2018 at 9:00am — No Comments
With text analytics, various burning questions around the ‘why’ and ‘what’ of a piece or group of content can be answered. Examples like social media chatter around brand can create a supremely spiraling impact (remember the post which showed a Kentucky man was violently removed from…Continue
Added by Preetish Panda on May 10, 2018 at 12:00am — No Comments
2016 Google trends data showed an all-time high in searches for “marketing automation”, reflecting a sustained interest in the term and the concept. Focused more on nurturing rather than selling, marketing automation is centered on content that is relevant, personalized, and in line with what customers want.
Marketing automation makes use of various web-based services and software to carry out, manage, as…Continue
Added by Rohit Chavan on May 11, 2018 at 7:00am — No Comments
Every business depends on data. For both the teen who mows lawns at the weekend and globe-spanning internet giants like Google, data is vital.
It might be addresses written on a piece of paper or petabytes of information stored on thousands of powerful servers. If it’s lost, the business loses money, time, and the respect of its customers.
Most business leaders…Continue
Added by Jay Caissie on May 22, 2018 at 10:00am — No Comments
We’re all familiar with data visualizations—word clouds, pie charts, pivot tables—but how does one put enquiries in paint? Patty Haller, a landscape artist from Seattle WA, may have figured that out. Her training in finance at University of Washington and traditional oil painting at Gage Academy of Art raised more questions than it answered.…Continue
Added by Julia Cook on March 26, 2018 at 1:30pm — No Comments
The word “Big data” prevailed in 2017, and it’s going to keep prevailing in the following years. In our previous post, I’ve introduced some concepts about big data, machine learning, and data mining (see post: Understanding Big data, Data mining, and Machine Learning in 5 Minutes). Now let's dig deeper into Machine Learning with a brief walk-through of some most commonly…Continue
Added by Paul Black on March 24, 2018 at 2:30am — No Comments
Data science is probably the most popular concept nowadays. I believe that many people are looking for an entrance to get inside the industry, and I just happened to read an article that lists some great data science books that may be helpful for you. So I concluded it in this article and I’ve also given the books brief introductions, so you can choose the ones you’d like to read. Some of the data science books you can find it online, and I've given out the links. But most of them I…Continue
Added by Paul Black on March 28, 2018 at 6:30pm — No Comments
There are certain cases where Apache Spark surpasses Hadoop. In this article, our experts will share their reviews about the things that make Apache Spark a superior choice over Hadoop.
Apache Spark is lightning fast cluster computing tool used by developers and programmers. This tool is up to 100 times faster than Hadoop MapReduce since it features faster-in-memory data analytics processing power. It is a Big Data framework that is used as a general purpose data processing engine on…Continue
Added by Joseph Macwan on March 6, 2018 at 12:30am — No Comments
There have been many articles written and talks given over the last several years on abandoning the Enterprise Data Warehouse (EDW) in favor of an Enterprise Data Lake with some passionately promoting the idea and others just as passionately denying that this is achievable. In this article, I would like to take a more pragmatic approach to the case and try and lay down a process that enterprises should consider for a data management architecture.
The focus is on data lakes for…Continue
Added by Shanti Subramanyam on February 26, 2018 at 11:00am — No Comments