, ,

Dataspace at A2 New Tech: Part II

We presented Golden Record, our cloud-hosted record matching technology at this month's A2 New Tech meeting. Here's a link!

What Do We Know About the Future of Staffing?

What do we know about the future of staffing? In short, not much. However there are hints that contract staffing could continue to grow as a critical strategic choice for many companies.

GOLDEN RECORD - Virtual Presentation at A2 New Tech 10/20!

Join us for our presentation on GOLDEN RECORD at the October Ann Arbor New Tech Meetup!

Surprised by AI: Volume II

Even robots are learning new skills these days! This issue of the Dataspace blog highlights some of the (maybe) surprising ways that AI is enriching our enrichment activities!

Resume Fraud: Are Grocery Stores Secretly Testing Your DNA?

Does a major grocery chain employ data scientists doing genetic testing? Or is this another case of resume fraud?

Will Covid Break Data Science?

Predictive analytics is about making educated guesses regarding the future based on things you know about the past. But, what if the future doesn't look anything like the past?

Not Enough College Football This Fall? Data Science Can Help

If you’re like me, these “unprecedented times” in the world of sports have opened up unprecedented hours of free time in your nights and weekends this fall. Data science can help!


As it turns out, this question (and variations of it) is one of the most frequently asked questions about data science that users search for via Google - as well as on other community platforms such as Reddit or Quora.


Much like an actual cloud that can look like whatever you imagine it to be, this virtual cloud is amorphous in its own right, and ever changing. Fortunately for those who want to leverage cloud computing to improve their data sharing, storage, and analytics capabilities it is less important to understand what "The Cloud" looks like than what it does. In this issue of our newsletter, we dive into some of the silver linings of cloud computing and its applications.

Resume Fraud: If you're going to copy from another resume, make sure that one makes sense!

Not to be dramatic but... if you haven't done the work, DO NOT PUT IT ON YOUR RESUME!
, , , ,


Much like an actual cloud that can look like whatever you imagine it to be, this virtual cloud is amorphous in its own right, and ever changing. Fortunately for those who want to leverage cloud computing to improve their data sharing, storage, and analytics capabilities it is less important to understand what "The Cloud" looks like than what it does. In this issue of our newsletter, we dive into some of the silver linings of cloud computing and its applications.
Bad Key

Email address is a really bad key!

At its heart, record matching is about uniqueness. We need to find what makes a record unique and then identify all the other records, in that data set and others, that share the properties that make it unique.
HP Hood Milk

Resume Fraud: Would This Company Use This Software?

For those who think there's a lot of fraud in resumes today...
Same but differentCreative Commons
, ,

Data Lake: Why record matching is critical to success

Today’s data lakes are like yesterday’s operational data stores (ODSs): Everyone has one and everyone means something different when they use the term. In my experience, a data lake is a place where analysts and data scientists can store and analyze data...

Be Assured, Data Science Ensures Success for Insurance

GREETINGS FROM DATASPACE! As we discussed in what may be our most viewed Dataspace blog post (ever!), insurance is one of the oldest data science based businesses - using math to make predictions about the future. In today's post, we’ll…
Purple Neutral Smiley

Why you need cloud neutrality

As you move your infrastructure to the cloud, it's important to consider how to not become beholden to a single cloud vendor. In other words, to attain cloud neutrality. I'm afraid that I don't have all the answers on how to do this, but I do…
, , ,

Danger, Will Robinson! (AI, Analytics and the Invisible Dangers)

While we may yet be a long way from the threat of truly sentient computers such as Hal 9000 and ARIIA, the current capability of AI technologies still offers plenty of power for those who wish to harness it for malicious purposes. These…

What's the Difference Between Matching & Persistent MDM?

The market offers matching solutions and the market offers MDM solutions. So, what's the difference between matching and master data management?

Technology, progress and ethics: a balancing act

Greetings from Dataspace! The last edition of our newsletter focused on some very creative applications of data science tools and methods. Most of these were lighthearted, merely reflecting the interest of the creators in their subject matter…
DuplicatesThe image is released free of copyrights under Creative Commons CC0.

Is it a Match or a Duplicate?

When trying to match similar records across data sets you'll run into two similar, but different, concepts: matching and deduplication...

Dataspace Newsletter: Your Chicken is Fried in Crude Oil - Data Science Proves It!

Greetings from Dataspace! In this issue of the Dataspace Newsletter we're taking a break from the more classical and businessy (aka boring) applications for machine learning and data science. Instead, we invite you to open the door to explore…
Dataspace's Golden Record

Data Privacy Compliance: Find people to keep them private

Today, the world is undergoing a massive event: the implementation of data privacy laws...

Dataspace Newsletter: More data, more problems (to solve)...

Greetings from Dataspace! Another week, another newsletter! Before we get into any new news, a quick update on a piece of old news: It looks like our new website was experiencing some problems of its own when we sent out our last newsletter…

Dataspace Newsletter: Grab your's time to update!

Greetings from Dataspace! Here's hoping that this week's newsletter finds you well, and that those of you who observe enjoyed Happy Passover and Easter celebrations, even if the family was only able to come together virtually. Much like…

Dataspace Newsletter: Flattening the (other) curve

Greetings from Dataspace! First of all, we here at Dataspace hope that you and all of your loved ones are safe and well in this time of uncertainty, and that we'll all soon be back on our feet. As the nation's workers struggle with unemployment…

Data science contractors – demand will increase in 2020

We’ve spent the past few years helping companies build their core data science functions by guiding them towards the best talent available in the market.  For many companies keen on gaining an edge in predictive technology, the dominant talent…

Data Science Salaries in Healthcare

How much should I pay for a data scientist? Being that data science is a relatively new field, we often get questions regarding compensation for various data science related skillsets across different locations and industries. Here are some…

Dataspace to sponsor 2019 Indy Big Data Conference in Indianapolis

Dataspace is thrilled to announce that we will be sponsoring the 2019 Indy Big Data Conference in Indianapolis, Indiana on September 19th!   We would love to meet any and all big data, data science and other analytics professionals…

A Big Mistake Data Science Recruiters Make

There’s a difference between a data scientist and a Python programmer who knows some data science frameworks   We’ve recently heard from a number of recruiters who’ve been burned. They were tasked with finding data scientists…
data science

Dataspace featured in Dice article about hiring better data scientists

Dataspace's Ben Taub was featured in a Dice post offering some clever suggestions for hiring excellent data science talent.  We encourage you to take a peek and absorb some of the wisdom within!…

Changing this one thing about your company can help you hire better data scientists

We begin many blog posts by highlighting the acute shortage of data scientists and the resultant difficulty in attracting them to your company.  Chances are that if you are reading this, you are well aware of that fact and may have already…

Dataspace to sponsor 2019 INFORMS Conference on Business Analytics & Operations Research

Dataspace is excited to announce that we will be sponsoring the 2019 INFORMS Conference on Business Analytics & Operations Research in Austin, Texas from April 14-16!   We encourage all local data science and business analytics…

What's The Difference Between Business Intelligence and Data Science?

Tools and techniques for management reporting and analysis have evolved since computers first came out. One can argue that the first management reporting tool was COBOL (COmmon Business Oriented Language). It allowed business people to get data…

Dataspace to sponsor 6th Annual Big Data & Business Analytics Summit at WSU

For the second year in a row, Dataspace is pleased to announce that we will be sponsoring the 6th Annual Big Data & Business Analytics Summit at Wayne State University in Detroit, MI from March 21st to 22nd, 2019! If you are a local…

Making your company more appealing to data scientists

As many a frustrated recruiter or hiring manager can tell you - a good data scientist is hard to find, and even harder to hire. Quality candidates frequently end up with multiple final round interviews (often within days of each other) leading…

Resume fraud: Warning signs and how to protect yourself

If you have experience in the tech staffing business, you may have noticed a recent rise in the amount of fraud and dishonesty in the marketplace. This is perhaps inevitable given the high level of demand for data engineering and data science…

What to look for in a data science resume

The process of screening candidates for any role can be both daunting and time consuming, and even more so when the role in question is a highly technical position such as a Data Scientist. The ever growing list of tools and technologies that…
contractors for data science

Step 1: Collect data. Step 3: Profit. Step 2: Hire data scientist?

Data science is, and has been, in vogue.  Every forward-thinking company wants to have a data science program because by now it is conventionally understood that it will improve profitability and efficiency across the business.   To…
contractors for data science

Why you should consider contractors for data science

If you’re like many of the analytics leaders and HR professionals we talk to, you know how hard it is to build a strong data science team. Given that everyone is chasing a few fish in a small pool, it’s hard to find strong candidates with…

Should we be more open to sponsoring analytics talent?

Trends seen throughout the majority of 2017 persist in the current job market - low unemployment rates and rapid job growth across multiple sectors. These factors have combined to create a highly competitive hiring environment and a shortage…

In Defense of Relational Databases

As you may know, many big data technologies are defined as schema on read. What this means is that you can throw whatever you want on the disk and then, when you need the data, you tell the data store what that data means (e.g. the second column…

The Shifting Role of IT in Analytics

In the distant past (circa 2015), IT was responsible for providing data consumers with data, tools, and development expertise. Today, the landscape is shifting, especially in larger organizations. Nowadays, IT provides some of the data - the…

Dataspace to sponsor 5th Annual Big Data & Business Analytics Symposium at WSU

Dataspace is happy to announce that we will be sponsoring the 5th Annual Big Data & Business Analytics Symposium at Wayne State University in Detroit, MI from March 22nd to 23rd, 2018!   If you plan to attend, please stop by our…

What does it mean to hash data and do I really care?

Hashing is simply passing some data through a formula that produces a result, called a hash. That hash is usually a string of characters and the hashes generated by a formula are always the same length, regardless of how much data you feed into it.

Data Science and Predictive Analytics Explained in Two Sentences

In data science and predictive analytics we determine the likelihood of something by looking at data about it. We do this simply by looking for similarities between that data and data from past cases where we actually know the outcome.

Dataspace to sponsor Southeast SAS Users Group 2017

We are proud to announce that Dataspace is a corporate sponsor of the 25th Annual Southeast SAS Users Group (SESUG 2017) conference in Cary, NC from November 5th to 7th, 2017! Kiran Venna, our featured conference speaker, will be presenting…

Kafka: What Is it?

Number One in Dataspace’s Data Science Series: What does it mean?   If nothing else, the data science industry is good at coming up with new, unique, confusing names and terms.  ZooKeeper, MapReduce, Hadoop, Pig, Storm, Mahout MongoDB…the…
artificial intelligence insurance

How Artificial Intelligence is changing the insurance industry

Any serious conversation about the future of data and analytics invariably turns to the topic of artificial intelligence.  The past year has seen AI surge in popularity, with high profile corporations and personalities getting behind what many…
Pitching data science budget

Pitching Your 2018 Data Science Budget

Last week we offered some suggestions on how to attack your data management initiatives in 2018 according to your organization’s level of data expertise.  This week we follow up with some tips on how to pitch those new data science technologies…
data science budget

Planning Your 2018 Data Science Budget? Here Are Some Logical Projects

It’s that time of year again!  With budgeting for 2018 at the forefront of your agenda, you may be wondering where to head with your data science efforts.  Investments in big data are often expensive, but when planned correctly you can manage…

How Big Data Helps Detect Hacking

Cyberattacks have become increasingly damaging and visible in recent years, in part because of numerous, high-profile instances of hacking affecting everything from your personal files to global election outcomes.   And, in fact, hacking…
facial recognition

Biometrics and Big Data: Facial Recognition

A key feature of big data is its lack of structure - we’re talking about the stuff that doesn’t fit neatly into excel columns or that is easily described on first glance using numbers or other descriptors.  This includes things such as…

The Difference Between an Actuary and a Data Scientist

The hot new thing, data science, isn’t so new after all. Since the advent of modern actuarial science in the late 1980s, insurance companies have relied on actuaries to use math and statistics to anticipate the future.

Do you need real-time business analytics? Start by asking yourself these 4 questions

If you have huge volumes of data, chances are that you can get value out of analyzing that data. But, would you derive significant value from analyzing it in real time? This is an important question because, while there are benefits to real…

So, what is the difference between a programmer and a data scientist?

There exists a chronic confusion as to what the distinction is between your average software engineer (i.e. programmer) and a data scientist.  This is totally understandable, considering the fact that both jobs do involve programming and the term “data science” seems so much like the term “computer science”.  However, the two differ in some significant ways.  

Are you taking the right approach to big data staffing?

The big data and predictive analytics landscape continues to shift rapidly as old practices are phased out and new technologies enter the mainstream on a virtually constant basis.  As these changes continue to play out, business and IT managers…
it department

The Sad Future of the Corporate IT Department

People in the tech world love to speculate on the future structure of the corporate IT department. While we cannot know precisely how this change will manifest, there are undeniable trends towards a more engaged and decentralized IT presence in organizations. This may very likely end in the elimination of the IT department as we know it.

Two Birds With One Stone: Learn Data Science and A Foreign Language at the Same Time!

We here at Dataspace thought it would be helpful to share some intriguing examples of how data can be easily manipulated to bring efficiency and value into our day-to-day lives. This week we will focus on how the daunting and time-consuming task of learning a foreign language can be made easier by taking a data-driven approach.
big data

Is my Big Data giving me Big Value?

Companies often find themselves wondering whether or not they have big data and questioning whether they even need business intelligence and predictive analytics to improve their business strategy. The reality is that you probably have more data than you think you do, but that the systems are not in place to make it useful.

Driving value through end-user engagement

It is important to remain aware of the fact that making data work for you is as much about people as it is about technology. Dataspace has learned a lot about end-user engagement with predictive software and business intelligence infrastructures over the past two decades. After all, it is not the products but the user of these products, the guy or gal who makes the day-to-day decisions, which ultimately determines whether or not your organization uses data effectively.
data driven

Becoming a Data-Driven Company in 2017

By now, we all know that big data and predictive modeling are at the heart of any successful and competitive corporate strategy.  But making data-driven decisions means more than just selecting software or implementing a BI system; it means…
terrified child plays computer

Business Intelligence Training: What to Study Over Lunch in 2017

If you’re like me, you’re an exceedingly attractive, 50-something data geek with a love for aviation history who’s an amazing ice hockey player. You also spend a lot of lunches eating a sandwich while studying new technologies. This…

If You Get This BI Requirement, Run!

The Lamest BI Requirement – And it Won’t Go Away!   Want a sure path to failure in BI and data warehousing? Don’t solicit requirements from your users.   Want a second path? Settle for the requirement, “Just give me everything…
data topology

A Big Thing I’ve Learned Assessing BI Systems

Assessing BI & DW Systems   A decent (and fun!) chunk of our business is assessing organizations’ business intelligence and data warehousing capabilities. In assessments we meet with business executives and IT staff to discuss…

Do You Really Have Data Management Principles?

Looking back on the work we’ve done over the past few years I’ve come across an interesting point: while companies may have mature data management organizations (DMOs), few of these DMOs have a set of principles behind their data management…
Declining Business Sales

A Dumb Thing that 99 out of 100 Data Warehousers do

Is your terminology causing your BI / DW team to head in the wrong direction?   As a business intelligence and data warehousing consultant I constantly work with IT teams that, even internally, can’t agree on whether they have a…

Programming: Get it into School!

For those who’ve been wondering why schools don’t teach at least introductory programming, a large group of industry and government leaders have drafted and signed a petition asking congress to provide funding for this.   While I…
Ambition Concept

When Business Moves Faster than BI

When Business Moves Faster than BI   Face the Facts – IT Does Interfere With Business Agility   It’s a rare organization where the business staff doesn’t complain that IT is holding them back. This is, in fact, one of the key…
do i have big data

Big Data: What to do if You Suspect You Have It

Do I Have Big Data?   If there is one question we hear from almost every Dataspace client, it is, “Do I have big data?” Many clients are torn about the prospect of big data. On the one hand, having big data represents a sort of badge…
data warehouse

Death of the Data Warehouse

I build data warehouses. I understand why they’re important, I make a living from them. I also see that traditional, relational data warehouses are on the way out. Their demise is coming from a few technological advances. One of these is…
data driven

More Thoughts on Becoming Data Driven

A few weeks ago I published a post on how to become a data driven company.  That post spoke to what managers can do to embed BI in their organizations.  Today, a different perspective on what user departments can do to become more data driven.   Users…

NPrinting: A Cost-Effective Way to Publish Your Qlikview Analyses

Challenge Here at Dataspace we’ve worked with Qlikview for a number of years.  It’s a great tool but it has one, glaring limitation: Qlikview does not offer an inexpensive, native method to publish and distribute reports to users.  In…

DataHero – Visualization for the Non-technical

I am consistently impressed by the number and quality of new data tools I’m seeing online. In the past, if you did not have access to an enterprise-level tool, your charting and analysis was done in Excel (and even if you did have an enterprise-level…

BI Predictions for 2015: 2 of 4

While different people will point to different times for the birth of business intelligence, I’ve always argued that BI started with the invention of writing.  Writing was invented, in part, as an accounting device.  How do I know how many…

BI Predictions for 2015: 3 of 4

DW Appliances Move from Strategic Options to Tactical Ones   As noted earlier, big companies are experimenting with Hadoop and will eventually lean on it heavily.  Proponents usually describe two key benefits of Hadoop: its ability to…

BI Predictions for 2015: 4 of 4

Cloud BI Opens the Field to Smaller & Smaller Organizations   Over the past year or two the number of software as a service business intelligence tools has exploded.  Some, like Microstrategy Cloud and Tableau Online, are offered…

BI Predictions for 2015: 1 of 4

While different people will point to different times for the birth of business intelligence, I’ve always argued that BI started with the invention of writing.  Writing was invented, in part, as an accounting device.  How do I know how many…

The Business Intelligence Sandbox

A few weeks ago I wrote about the idea that there are really only two uses for BI: analysis and business process improvement.  Today, let’s focus a bit more on analytic users / analysts.  In particular, let’s consider the concept of the…

You Can Only do Two Things With BI

I get a kick out of how vendors tout Business Intelligence. It’s usually something like, “Get more out of your data” OR “Don’t you want to be data driven?” OR “All the really cool companies are doing it, shouldn’t you?”  The…

Signs you need BI – #1: People are overusing MS Excel and Access

Are you wondering if your organization is at the point where it needs to improve its reporting capabilities and invest in a business intelligence system?  This note is the first in an occasional series of emails and blog posts that describe…

BI: On the State of Relations Between IT and End Users

Over the years, we have learned a whole lot about what it means to be a business intelligence end user, and we’d like to share some of what we’ve learned. While much of what we read about data warehousing and business intelligence is focused…

Confusing Terminology Part 1 – Business Intelligence

I remember watching a really good webinar one time entitled Ten Changes to Maximize the Impact of Your BI Strategy by Gartner analyst Kurt Schlegel.  I found it to be well worth the hour spent on it (perhaps because it confirmed many of my…

Business Intelligence Radar: Pearl Harbor and the Battle of Britain

What is Business Intelligence?   It’s the set of tools, abilities, and data required to create a picture of the world and how your organization operates in that world. In building your business intelligence capabilities, you can learn…

Star Schema: Data Mart? Yes – Data Warehouse? No

The Two Roles of a Data Warehouse   Most people think of data warehouses as databases that solve reporting problems. However, it’s more useful to think of them as addressing two sets of problems: 1) Reporting, or data distribution, problems…

Infinite MIPS, Or How Your Hardware Vendor Let you Down

The Concept of Data Warehousing is Fundamentally Flawed   Ever step back, think about what you’re doing, and then ask yourself, “Why?” Ever ask the same question about the concept of data warehousing? Its time to face facts – while…