Blog

Technology, progress and ethics: a balancing act

Greetings from Dataspace! The last edition of our newsletter focused on some very creative applications of data science tools and methods. Most of these were lighthearted, merely reflecting the interest of the creators in their subject matter…
DuplicatesThe image is released free of copyrights under Creative Commons CC0.

Is it a Match or a Duplicate?

Is it a match or a duplicate? When trying to match similar records across data sets you'll run into two similar, but different, concepts: matching and deduplication. Quite simply, deduplicating means finding (technically, eliminating) what is…

Dataspace Newsletter: Your Chicken is Fried in Crude Oil - Data Science Proves It!

Greetings from Dataspace! In this issue of the Dataspace Newsletter we're taking a break from the more classical and businessy (aka boring) applications for machine learning and data science. Instead, we invite you to open the door to explore…
Dataspace's Golden Record

Data Privacy Compliance: Find people to keep them private

Today, the world is undergoing a massive event: the implementation of data privacy laws. (And you thought this was another post about coronavirus, didn't you?). Once the pandemic passes, data and legal professionals are going to have to return…

Dataspace Newsletter: More data, more problems (to solve)...

Greetings from Dataspace! Another week, another newsletter! Before we get into any new news, a quick update on a piece of old news: It looks like our new website was experiencing some problems of its own when we sent out our last newsletter…

Dataspace Newsletter: Grab your tools...it's time to update!

Greetings from Dataspace! Here's hoping that this week's newsletter finds you well, and that those of you who observe enjoyed Happy Passover and Easter celebrations, even if the family was only able to come together virtually. Much like…

Dataspace Newsletter: Flattening the (other) curve

Greetings from Dataspace! First of all, we here at Dataspace hope that you and all of your loved ones are safe and well in this time of uncertainty, and that we'll all soon be back on our feet. As the nation's workers struggle with unemployment…

Data science contractors – demand will increase in 2020

We’ve spent the past few years helping companies build their core data science functions by guiding them towards the best talent available in the market.  For many companies keen on gaining an edge in predictive technology, the dominant talent…

Data Science Salaries in Healthcare

How much should I pay for a data scientist? Being that data science is a relatively new field, we often get questions regarding compensation for various data science related skillsets across different locations and industries. Here are some…

Dataspace to sponsor 2019 Indy Big Data Conference in Indianapolis

Dataspace is thrilled to announce that we will be sponsoring the 2019 Indy Big Data Conference in Indianapolis, Indiana on September 19th!   We would love to meet any and all big data, data science and other analytics professionals…

A Big Mistake Data Science Recruiters Make

There’s a difference between a data scientist and a Python programmer who knows some data science frameworks   We’ve recently heard from a number of recruiters who’ve been burned. They were tasked with finding data scientists…
data science

Dataspace featured in Dice article about hiring better data scientists

Dataspace's Ben Taub was featured in a Dice post offering some clever suggestions for hiring excellent data science talent.  We encourage you to take a peek and absorb some of the wisdom within! https://insights.dice.com/employer-resource…

Changing this one thing about your company can help you hire better data scientists

We begin many blog posts by highlighting the acute shortage of data scientists and the resultant difficulty in attracting them to your company.  Chances are that if you are reading this, you are well aware of that fact and may have already…

Dataspace to sponsor 2019 INFORMS Conference on Business Analytics & Operations Research

Dataspace is excited to announce that we will be sponsoring the 2019 INFORMS Conference on Business Analytics & Operations Research in Austin, Texas from April 14-16!   We encourage all local data science and business analytics…

What's The Difference Between Business Intelligence and Data Science?

Tools and techniques for management reporting and analysis have evolved since computers first came out. One can argue that the first management reporting tool was COBOL (COmmon Business Oriented Language). It allowed business people to get data…

Dataspace to sponsor 6th Annual Big Data & Business Analytics Summit at WSU

For the second year in a row, Dataspace is pleased to announce that we will be sponsoring the 6th Annual Big Data & Business Analytics Summit at Wayne State University in Detroit, MI from March 21st to 22nd, 2019! If you are a local…

Making your company more appealing to data scientists

As many a frustrated recruiter or hiring manager can tell you - a good data scientist is hard to find, and even harder to hire. Quality candidates frequently end up with multiple final round interviews (often within days of each other) leading…

Resume fraud: Warning signs and how to protect yourself

If you have experience in the tech staffing business, you may have noticed a recent rise in the amount of fraud and dishonesty in the marketplace. This is perhaps inevitable given the high level of demand for data engineering and data science…
infinite

What to look for in a data science resume

The process of screening candidates for any role can be both daunting and time consuming, and even more so when the role in question is a highly technical position such as a Data Scientist. The ever growing list of tools and technologies that…
contractors for data science

Step 1: Collect data. Step 3: Profit. Step 2: Hire data scientist?

Data science is, and has been, in vogue.  Every forward-thinking company wants to have a data science program because by now it is conventionally understood that it will improve profitability and efficiency across the business.   To…
contractors for data science

Why you should consider contractors for data science

If you’re like many of the analytics leaders and HR professionals we talk to, you know how hard it is to build a strong data science team. Given that everyone is chasing a few fish in a small pool, it’s hard to find strong candidates with…
infinite

Should we be more open to sponsoring analytics talent?

Trends seen throughout the majority of 2017 persist in the current job market - low unemployment rates and rapid job growth across multiple sectors. These factors have combined to create a highly competitive hiring environment and a shortage…

In Defense of Relational Databases

As you may know, many big data technologies are defined as schema on read. What this means is that you can throw whatever you want on the disk and then, when you need the data, you tell the data store what that data means (e.g. the second column…

The Shifting Role of IT in Analytics

In the distant past (circa 2015), IT was responsible for providing data consumers with data, tools, and development expertise. Today, the landscape is shifting, especially in larger organizations. Nowadays, IT provides some of the data - the…

Dataspace to sponsor 5th Annual Big Data & Business Analytics Symposium at WSU

Dataspace is happy to announce that we will be sponsoring the 5th Annual Big Data & Business Analytics Symposium at Wayne State University in Detroit, MI from March 22nd to 23rd, 2018!   If you plan to attend, please stop by our…
hashing

What does it mean to hash data and do I really care?

What is Hashing?   Hashing is simply passing some data through a formula that produces a result, called a hash. That hash is usually a string of characters and the hashes generated by a formula are always the same length, regardless of…

Predictive Analytics explained in two sentences

Predictive analytics is this super-complex field that only statisticians and data scientists can understand, right? Well, perhaps it takes some training to do it well but it only takes two sentences to understand what it's all about: In predictive…

Dataspace to sponsor Southeast SAS Users Group 2017

We are proud to announce that Dataspace is a corporate sponsor of the 25th Annual Southeast SAS Users Group (SESUG 2017) conference in Cary, NC from November 5th to 7th, 2017! Kiran Venna, our featured conference speaker, will be presenting…
kafka

Kafka: What Is it?

Number One in Dataspace’s Data Science Series: What does it mean?   If nothing else, the data science industry is good at coming up with new, unique, confusing names and terms.  ZooKeeper, MapReduce, Hadoop, Pig, Storm, Mahout MongoDB…the…
artificial intelligence insurance

How Artificial Intelligence is changing the insurance industry

Any serious conversation about the future of data and analytics invariably turns to the topic of artificial intelligence.  The past year has seen AI surge in popularity, with high profile corporations and personalities getting behind what many…
Pitching data science budget

Pitching Your 2018 Data Science Budget

Last week we offered some suggestions on how to attack your data management initiatives in 2018 according to your organization’s level of data expertise.  This week we follow up with some tips on how to pitch those new data science technologies…
data science budget

Planning Your 2018 Data Science Budget? Here Are Some Logical Projects

It’s that time of year again!  With budgeting for 2018 at the forefront of your agenda, you may be wondering where to head with your data science efforts.  Investments in big data are often expensive, but when planned correctly you can manage…

How Big Data Helps Detect Hacking

Cyberattacks have become increasingly damaging and visible in recent years, in part because of numerous, high-profile instances of hacking affecting everything from your personal files to global election outcomes.   And, in fact, hacking…
facial recognition

Biometrics and Big Data: Facial Recognition

A key feature of big data is its lack of structure - we’re talking about the stuff that doesn’t fit neatly into excel columns or that is easily described on first glance using numbers or other descriptors.  This includes things such as…
actuary

The Difference Between an Actuary and a Data Scientist

The hot new thing, data science, isn’t so new after all. Since the advent of modern actuarial science in the late 1980s, insurance companies have relied on actuaries to use math and statistics to anticipate the future.   "A data…

Do you need real-time business analytics? Start by asking yourself these 4 questions

If you have huge volumes of data, chances are that you can get value out of analyzing that data. But, would you derive significant value from analyzing it in real time? This is an important question because, while there are benefits to real…

So, what is the difference between a programmer and a data scientist?

There exists a chronic confusion as to what the distinction is between your average software engineer (i.e. programmer) and a data scientist.  This is totally understandable, considering the fact that both jobs do involve programming and the…

Are you taking the right approach to big data staffing?

The big data and predictive analytics landscape continues to shift rapidly as old practices are phased out and new technologies enter the mainstream on a virtually constant basis.  As these changes continue to play out, business and IT managers…
it department

The Sad Future of the Corporate IT Department

People in the tech world love to speculate on the future structure of the corporate IT department. While we cannot know precisely how this change will manifest, there are undeniable trends towards a more engaged and decentralized IT presence in organizations. This may very likely end in the elimination of the IT department as we know it.

Two Birds With One Stone: Learn Data Science and A Foreign Language at the Same Time!

We here at Dataspace thought it would be helpful to share some intriguing examples of how data can be easily manipulated to bring efficiency and value into our day-to-day lives. This week we will focus on how the daunting and time-consuming task of learning a foreign language can be made easier by taking a data-driven approach.
big data

Is my Big Data giving me Big Value?

Companies often find themselves wondering whether or not they have big data and questioning whether they even need business intelligence and predictive analytics to improve their business strategy. The reality is that you probably have more data than you think you do, but that the systems are not in place to make it useful.

Driving value through end-user engagement

It is important to remain aware of the fact that making data work for you is as much about people as it is about technology. Dataspace has learned a lot about end-user engagement with predictive software and business intelligence infrastructures over the past two decades. After all, it is not the products but the user of these products, the guy or gal who makes the day-to-day decisions, which ultimately determines whether or not your organization uses data effectively.
data driven

Becoming a Data-Driven Company in 2017

By now, we all know that big data and predictive modeling are at the heart of any successful and competitive corporate strategy.  But making data-driven decisions means more than just selecting software or implementing a BI system; it means…
terrified child plays computer

Business Intelligence Training: What to Study Over Lunch in 2017

If you’re like me, you’re an exceedingly attractive, 50-something data geek with a love for aviation history who’s an amazing ice hockey player. You also spend a lot of lunches eating a sandwich while studying new technologies. This…
EVERYTHING MUST GO

If You Get This BI Requirement, Run!

The Lamest BI Requirement – And it Won’t Go Away!   Want a sure path to failure in BI and data warehousing? Don’t solicit requirements from your users.   Want a second path? Settle for the requirement, “Just give me everything…
data topology

A Big Thing I’ve Learned Assessing BI Systems

Assessing BI & DW Systems   A decent (and fun!) chunk of our business is assessing organizations’ business intelligence and data warehousing capabilities. In assessments we meet with business executives and IT staff to discuss…
constitution

Do You Really Have Data Management Principles?

Looking back on the work we’ve done over the past few years I’ve come across an interesting point: while companies may have mature data management organizations (DMOs), few of these DMOs have a set of principles behind their data management…
Declining Business Sales

A Dumb Thing that 99 out of 100 Data Warehousers do

Is your terminology causing your BI / DW team to head in the wrong direction?   As a business intelligence and data warehousing consultant I constantly work with IT teams that, even internally, can’t agree on whether they have a…
lehigh

Programming: Get it into School!

For those who’ve been wondering why schools don’t teach at least introductory programming, a large group of industry and government leaders have drafted and signed a petition asking congress to provide funding for this.   While I…
Ambition Concept

When Business Moves Faster than BI

When Business Moves Faster than BI   Face the Facts – IT Does Interfere With Business Agility   It’s a rare organization where the business staff doesn’t complain that IT is holding them back. This is, in fact, one of the key…
do i have big data

Big Data: What to do if You Suspect You Have It

Do I Have Big Data?   If there is one question we hear from almost every Dataspace client, it is, “Do I have big data?” Many clients are torn about the prospect of big data. On the one hand, having big data represents a sort of badge…
data warehouse

Death of the Data Warehouse

I build data warehouses. I understand why they’re important, I make a living from them. I also see that traditional, relational data warehouses are on the way out. Their demise is coming from a few technological advances. One of these is…
data driven

More Thoughts on Becoming Data Driven

A few weeks ago I published a post on how to become a data driven company.  That post spoke to what managers can do to embed BI in their organizations.  Today, a different perspective on what user departments can do to become more data driven.   Users…
infinite

NPrinting: A Cost-Effective Way to Publish Your Qlikview Analyses

Challenge Here at Dataspace we’ve worked with Qlikview for a number of years.  It’s a great tool but it has one, glaring limitation: Qlikview does not offer an inexpensive, native method to publish and distribute reports to users.  In…
infinite

How to Go from Delivering BI to Being Data Driven

If you participate in enough BI groups, you’ll see one question come up time and again: Why do so many BI efforts fail?  A lot of time it’s not failure in development, but in the implementation.  Let’s face it – too often these systems…
infinite

DataHero – Visualization for the Non-technical

I am consistently impressed by the number and quality of new data tools I’m seeing online. In the past, if you did not have access to an enterprise-level tool, your charting and analysis was done in Excel (and even if you did have an enterprise-level…
infinite

BI Predictions for 2015: 2 of 4

While different people will point to different times for the birth of business intelligence, I’ve always argued that BI started with the invention of writing.  Writing was invented, in part, as an accounting device.  How do I know how many…
infinite

BI Predictions for 2015: 3 of 4

DW Appliances Move from Strategic Options to Tactical Ones   As noted earlier, big companies are experimenting with Hadoop and will eventually lean on it heavily.  Proponents usually describe two key benefits of Hadoop: its ability to…
infinite

BI Predictions for 2015: 4 of 4

Cloud BI Opens the Field to Smaller & Smaller Organizations   Over the past year or two the number of software as a service business intelligence tools has exploded.  Some, like Microstrategy Cloud and Tableau Online, are offered…
infinite

BI Predictions for 2015: 1 of 4

While different people will point to different times for the birth of business intelligence, I’ve always argued that BI started with the invention of writing.  Writing was invented, in part, as an accounting device.  How do I know how many…
sandbox

The Business Intelligence Sandbox

A few weeks ago I wrote about the idea that there are really only two uses for BI: analysis and business process improvement.  Today, let’s focus a bit more on analytic users / analysts.  In particular, let’s consider the concept of the…
infinite

You Can Only do Two Things With BI

I get a kick out of how vendors tout Business Intelligence. It’s usually something like, “Get more out of your data” OR “Don’t you want to be data driven?” OR “All the really cool companies are doing it, shouldn’t you?”  The…
infinite

Signs you need BI – #1: People are overusing MS Excel and Access

Are you wondering if your organization is at the point where it needs to improve its reporting capabilities and invest in a business intelligence system?  This note is the first in an occasional series of emails and blog posts that describe…
infinite

BI: On the State of Relations Between IT and End Users

Over the years, we have learned a whole lot about what it means to be a business intelligence end user, and we’d like to share some of what we’ve learned. While much of what we read about data warehousing and business intelligence is focused…
infinite

Confusing Terminology Part 1 – Business Intelligence

I remember watching a really good webinar one time entitled Ten Changes to Maximize the Impact of Your BI Strategy by Gartner analyst Kurt Schlegel.  I found it to be well worth the hour spent on it (perhaps because it confirmed many of my…

Business Intelligence Radar: Pearl Harbor and the Battle of Britain

What is Business Intelligence?   It’s the set of tools, abilities, and data required to create a picture of the world and how your organization operates in that world. In building your business intelligence capabilities, you can learn…
infinite

Star Schema: Data Mart? Yes – Data Warehouse? No

The Two Roles of a Data Warehouse   Most people think of data warehouses as databases that solve reporting problems. However, it’s more useful to think of them as addressing two sets of problems: 1) Reporting, or data distribution, problems…
infinite

Infinite MIPS, Or How Your Hardware Vendor Let you Down

The Concept of Data Warehousing is Fundamentally Flawed   Ever step back, think about what you’re doing, and then ask yourself, “Why?” Ever ask the same question about the concept of data warehousing? Its time to face facts – while…