Back Get this article in PDF
Data Mining with PCs?
Well, not really, although the goal of a new
methodology named contextual data modeling
resembles that of data mining. Both processes
are analytic tools that search databases to
uncover information that is not obvious on a
single flat table of data. Data mining involves
massive databases probed by large computers,
using mathematical formulas, to produce
analysis — even forecasts — based on statistical
theory.
Contextual data modeling involves small
databases, on personal computers, with the
mind of the user defining and controlling the
analysis. But the end result is the same:
decisions made by humans. Note, however, that
with contextual data modeling (CDM) the
processing and decision-making are not
separate. The user’s mind and the data
processing are interactive. CDM consists of a
series of formative decisions, reaching answers
to the initial question, and often answers to
questions that arise spontaneously as the user
studies that data from different perspectives.
How does this happen?
Making decisions based on new knowledge is
a philosophical matter. Immanuel Kant argued
that empirical observation alone does not
constitute knowledge. Kant said that knowledge
requires two components: sensory perception
plus a conceptual component of ordering and
then interpreting. Here is an example. Assume
you are driving, late in the day and tired, in a
sparsely settled area. You glance at the gas
gauge and perceive a low fuel level. Your mind
associates that fact with your knowledge that
your destination is 30 miles ahead, with your
desire to arrive there that evening, with your
feeling of fatigue, with your valuing alertness as
essential when driving, with your belief that the
next gas station may be far ahead, and with your
recollection that you just left a town with gas
stations and several motels. So, what do you
decide to do?
Whether you continue ahead, or go back for
fuel and then resume driving, or stay overnight
and refuel in the morning results from your
personal analysis of the situation. It started with
a single datum: low fuel level. That was Kant’s sensory perception. Your mind blended that
new knowledge with your prior knowledge,
goals, and values. Then your intuition and
imagination kicked in, you subconsciously
prioritized (Kant’s ordering) the many factors
involved and reached a final interpretation of
what they all — not just the single
datum— meant you should do.
You also look at an airport monitor seeking a
single datum: the correct gate number. That
same data table, however, contains a wealth of
information needed by other people—those
interested in all the data, not a single datum as
you were. Control tower staff want to see the
table with times in the first column. Persons
handling gate assignments want gate numbers
in the first column. A person curious about
traffic to a particular city wants destinations in
the first column. And in each of those cases,
different arrangements of the remaining
columns provide different information. How,
then, can a person extract from a database all
the useful information it contains? One answer
is data mining. Another answer is CDM. Both
terms use the same word, modeling, but the
meaning differs.
Data mining starts with a model of what is
intended, as with a business model for a new
venture, or a model of a boat one plans to
build. Algorithms are designed accordingly,
and the data are processed to derive the
answer generated according to the model.
Modeling with CDM, however, is similar to
modeling a lump of clay to change it into a
useful object. And as we do that we look at it
from many different perspectives. With CDM
we also add motion/movement as a key factor
in the analysis of a data table. How do we do
that?
A data table can be modeled three ways.
First, the columns can be rearranged. Second
the table can be shortened or lengthened by re-selecting the items to be listed. Third, the
values of the parameters of items on the table
can be revised. Thousands of different tables
can be designed, each giving the decision-
maker new knowledge. But that takes time. And
what a person sees is held for processing primarily in short-term memory. So potentially
valuable new knowledge can fade quickly while
we wait for a new printout. Further, the pattern
of our current reasoning may be lost unless the
desired changes are made immediately. (We
don’t understand a statue by looking at the
front, then a side hours or days later, etcetera).
Consequently, the analyst must be able to easily
and interactively introduce changes, immediately
see the results, and continue that process until
all possible useful information has been
disclosed. Data mining does not lend itself to
that. With CDM, the analyst can see — in a
controlled series of varied tables — the values of
selected parameters, and get their relationships
in the context of (1) all the displayed parameter
values, (2) other data in that person’s mind, and (3) his or her current line of reasoning. Thus the
name contextual data modeling.
Our goal was giving to analysts of small
datasets the ability to personally interact with
their data using only three primary screens:
Query (design a data table), View (see the query
result), and Edit (change parameter values, and
then see the table automatically resorted (excellent for "what if" questions). That
required one simple invention: combining on one screen the controls for the select and sort functions. A patent was granted.
Then, to better manage examining all
possible permutations of a data table (for
example, 6 columns can be arranged 720
ways) we invented automated sorting. It is not
logical to consider a permutation useless until
it is examined. Another patent was granted.
Consistent with the goal of shortening the
link between the user’s mind and the monitor,
our CDM programs contain, on the Edit screen
(same as the entry screen) item descriptions, of
any length, rather than simply the names of the
items. Also listed are links to other items in the
dataset, not just those on the table, that relate
to the one being viewed. And to that screen the
user can attach notes containing temporary
remarks. Then, with the Edit screen open, the
user flips back and forth through similar
screens for all items on the current table.
Sensory perception is enriched in the process.
Comparing CDM programs with other
software is like comparing oranges with
tomatoes, both being fruit. The only software
incorporating our patented technology, our
CDM programs are a new and different type of
software, fitting no existing category.
Back Get this article in PDF