Soon, you might not need anything more specialized than a readily accessible touchscreen device and any existing data sets you have access to in order to build powerful prediction tools. A new experiment from MIT and Brown University researchers have added a capability to their ‘Northstar’ interactive data system that can “instantly generate machine-learning models” to use with their exiting data sets in order to generate useful predictions.
One example the researchers provide is that doctors could make use of the system to make predictions about the likelihood their patients have of contracting specific diseases based on their medial history. Or, they suggest, a business owner could use their historical sales data to develop more accurate forecasts, quickly and without a ton of manual analytics work.
Researchers are calling this feature the Northstar system’s “virtual data scientist,” (or VDS) and it sounds like it could actually replace the human equivalent, especially in settings where one would never actually be readily available or resourced anyway. Your average doctor’s office doesn’t have a dedicated data scientist headcount, for instance, and nor do most small- to medium-sized businesses for that matter. Independently owned and operated coffee shops and retailers definitely wouldn’t otherwise have access to this kind of insight.
This new tool is built on automated machine-learning techniques that are becoming much more ‘au courant,’ since it helps expand the number of people for whom AI technology is accessible.
Northstar itself is the product of more than four years of work, and presents a blank canvas that’s compatible across multiple platforms, and then users can upload their own data sets, which show up as boxes on the interface. They can then drag and drop those into the centre area of the canvas and then draw connecting lines to indicate to that they should be processed with an algorithm of their choosing in combination with one another.
So basically, they could theoretically grab a dataset detailing metabolic rates of patients, and another one detailing their age, and then derive from that how often a specific disease occurs across those two factors. Now, with the new virtual data scientist feature, they’ll be able to combine inputs to generate predictive, AI-based analysis across these combined factors as well.
Researchers have also designed this VDS system so that it’s actually the fastest application of automated machine learning to date. That’s another key piece for making it usable by everyone, since it’s not really feasible to imagine people working with this digital whitetable and then waiting ours for results to come out. Next up, it’s going to improve error reporting to help ensure that non-specialist users not only find it easy to use, but also get clear indicators when they do something wrong so they can fix it next time.