A Data Science Central Community
Join the largest community of machine learning (ML), deep learning, AI, data science, business analytics, BI, operations research, mathematical and statistical professionals: Sign up here. If instead, you are only interested in receiving our newsletter, you can subscribe here. There is no…Continue
Added by Vincent Granville on September 8, 2018 at 10:18am — No Comments
Added by Paul Black on July 12, 2018 at 4:00am — No Comments
Data loss comes in all shapes and sizes, from an accidentally deleted document to a catastrophic server failure that wipes out a critical database. Modern businesses are absolutely reliant on data, and on the infrastructure they use to store, process, and access that data. The grim reality is that those systems and the people who use them are fallible. Most business owners are aware that backups are important, but few have a full disaster recovery plan in…Continue
Added by Karl Zimmerman on May 30, 2018 at 12:13pm — No Comments
Black hat data science consists of techniques designed to fool existing algorithms (Google search, Amazon rankings, and so on), compromising or tampering with the metrics -- especially ratios -- that they rely on, without actually physically touching or altering data stored in their databases. It exploits flaws in these algorithms, and it also relies on reverse engineering, to achieve its goal. So black hat data science is different from traditional hacking,…Continue
Added by Vincent Granville on May 23, 2018 at 9:00am — No Comments
With text analytics, various burning questions around the ‘why’ and ‘what’ of a piece or group of content can be answered. Examples like social media chatter around brand can create a supremely spiraling impact…Continue
Added by Preetish Panda on May 10, 2018 at 12:00am — No Comments
2016 Google trends data showed an all-time high in searches for “marketing automation”, reflecting a sustained interest in the term and the concept. Focused more on nurturing rather than selling, marketing automation is centered on content that is relevant, personalized, and in line with what customers want.
Marketing automation makes use of various web-based services and software to carry out, manage, as…Continue
Added by Rohit Chavan on May 11, 2018 at 7:00am — No Comments
Every business depends on data. For both the teen who mows lawns at the weekend and globe-spanning internet giants like Google, data is vital.
It might be addresses written on a piece of paper or petabytes of information stored on thousands of powerful servers. If it’s lost, the business loses money, time, and the respect of its customers.
Most business leaders…Continue
Added by Jay Caissie on May 22, 2018 at 10:00am — No Comments
We’re all familiar with data visualizations—word clouds, pie charts, pivot tables—but how does one put enquiries in paint? Patty Haller, a landscape artist from Seattle WA, may have figured that out. Her training in finance at University of Washington and traditional oil painting at Gage Academy of Art raised more questions than it answered.…Continue
Added by Julia Cook on March 26, 2018 at 1:30pm — No Comments
The word “Big data” prevailed in 2017, and it’s going to keep prevailing in the following years. In our previous post, I’ve introduced some concepts about big data, machine learning, and data mining (see post: Understanding Big data, Data mining, and Machine Learning in 5 Minutes). Now let's dig deeper into Machine Learning with a brief walk-through of some most commonly…Continue
Added by Paul Black on March 24, 2018 at 2:30am — No Comments
Data science is probably the most popular concept nowadays. I believe that many people are looking for an entrance to get inside the industry, and I just happened to read an article that lists some great data science books that may be helpful for you. So I concluded it in this article and I’ve also given the books brief introductions, so you can choose the ones you’d like to read. Some of the data science books you can find it online, and I've given out the links. But most of them I…Continue
Added by Paul Black on March 28, 2018 at 6:30pm — No Comments
There are certain cases where Apache Spark surpasses Hadoop. In this article, our experts will share their reviews about the things that make Apache Spark a superior choice over Hadoop.
Apache Spark is lightning fast cluster computing tool used by developers and programmers. This tool is up to 100 times faster than Hadoop MapReduce since it features faster-in-memory data analytics processing power. It is a Big Data framework that is used as a general purpose data processing engine on…Continue
Added by Joseph Macwan on March 6, 2018 at 12:30am — No Comments
There have been many articles written and talks given over the last several years on abandoning the Enterprise Data Warehouse (EDW) in favor of an Enterprise Data Lake with some passionately promoting the idea and others just as passionately denying that this is achievable. In this article, I would like to take a more pragmatic approach to the case and try and lay down a process that enterprises should consider for a data management architecture.
The focus is on data lakes for…Continue
Added by Shanti Subramanyam on February 26, 2018 at 11:00am — No Comments
Talent recruitment has always been a problem for companies in the technology sphere, especially in expanding markets where business growth and technological innovation generate intense competition. Over the next couple of years, recruitment will get even harder. GDP growth is at about 3 percent and it’s expected to remain at…Continue
Added by Dean Madison on March 1, 2018 at 3:30pm — No Comments
These predictions for 2018 are from Infologix.
“Metadata management and ensuring data privacy for regulations such as GDPR joins earlier trends like AI and IoT, but the unexpected trend of 2018 will be the convergence of data management technologies,” said Emily Washington, senior vice president of product management at Infogix. “Big data has been the next big technology phenomenon for a long time, but businesses are increasingly evaluating ways to…Continue
Added by Vincent Granville on December 30, 2017 at 10:59am — No Comments
Technology always takes a dominant position in economy and society. Millions of people therefore found their careers, and many others have even dived into a completely different field just for entering this industry. Even so, enterprises are still trying hard to seek for skilled programmers; when the right one shows, companies would even raise HR budgets. Technology is continuing to infiltrate into new platforms and industries, hence, to maximize one’s profit potentials, also for…Continue
Added by Paul Black on December 6, 2017 at 7:30pm — No Comments
The full membership includes, in addition to the newsletter…Continue
Added by Vincent Granville on November 29, 2017 at 11:13am — No Comments
Have you ever felt frustrated when try to look for some data on Google? Pages of relevant websites but none can fulfill your expectation? Have you ever felt that your articles are less persuasive without data support?
Added by Paul Black on October 30, 2017 at 7:30pm — No Comments
This famous statement -- the six degrees of separation -- claims that there is at most 6 degrees of separation between you and anyone else on Earth. Here we feature a simple algorithm that simulates how we are connected, and indeed confirms the claim. We also explain how it applies to web crawlers: Any web page is connected to any other web page by a path of 6 links at most.
The algorithm below is rudimentary and can be used for simulation purposes by any programmer: It does not even…Continue
Indexing is commonly used among programmers. Without fully grasping the idea behind the technique, a programmer is always eager to take advantage of it whenever they encounter a query performance problem, only to get disappointed by the result on many occasions. By analyzing the principle of indexing, the article tries to show programmers when is the appropriate time to use an index and how to use it.
The purpose of indexing is to quickly find…Continue
Added by JIANG Buxing on August 29, 2017 at 12:30am — No Comments
By JIANG Buxing
In the previous article, we discussed the necessity of the existence of a computing layer in the reporting architecture. Reporting tools support the user-defined interface-based programming with its host language (i.e. the programming language used for developing a reporting tool) to achieve the functionality of a computing layer for implementing complex computational logics, but the strategy reveals some real-life problems. An explicit data computing layer…Continue
Added by JIANG Buxing on August 24, 2017 at 10:30pm — No Comments