Semantic (r)evolutions: January 2014

Sunday, January 12, 2014

A leap in artificial intelligence ? Continued 3

In this saga about our (yours and mine)

Artificial intelligence that learns from associated parallel streams of information

today's reflexions are about the order in which we would like this AI to learn things.

Despite that we still don't know much about what this new AI should do (see previous post) the show must go on. (But feel free to still add your contributions to my previous post).

Spending a little bit of grey energy on what this AI should learn first might by handy.

Ever thought about why we use smiley's?

Have a look at this short video showing twice the Dutch phrase "Ja, ja" (in English: "Yes, yes" as you might have guessed):

The first case is a double affirmation / agreement, the second on the other hand means something like: "talk as much as you want, but I don't believe you".

Another striking example has been posted by +Maya Davis in mai 2013, about the pronunciation of Shakespeare's work.

In the field of computational linguistics we know very well that interpreting written text is confronted with the issue of highly complex structural relationships (as opposed to linear order). Making unambigous interpretation extremely difficult.

Obviously something important is lacking in written text:

The extra things we have in spoken text: pitch, stress, pauses, etc.
And body language if we can see the person.

But we do understand written text, don't we? So what's up?

My interpretation is that when we read we imagine what the melody, intonations etc are (or are supposed to be). Sometimes we imagine the corresponding body language also.
Smiley's are one way to fill in the gaps in expressiveness of text compared to the spoken word.

In evolution?
As a baby, did you start writing or seeing and listening?
Language could be as old as 100,000 years, where as writting appeared around 3200 BCE.
Whow, that were two easy ones!

What do you think our AI should start with:

the texts available everywhere
or
rich speech recognition?

What do you think to see when looking at a picture?
Something in two dimensions or do you make the shift to 3D?
If so, why can you do it?

Does M.C. Esher ring a bell?

Drawing by M.C. Esher

In images like the one above, personally I shift to a series of 3D images. Changing the image with every part of the drawing I'm focusing on. No way for me to make it into a single coherent picture.

And classical video (not 3D)? Do you "see" depth? What do you use to interpret a 2D video into a 3D live like experience?

Like for the language: What do you think the AI should start with:

2D
or
3D vision (binocular or perhaps even N-ocular)?

Finally for the vision component: what would you like the AI to see here:

post from +Donavon Urfalian

An owl
or
a bunch of fruit and vegetables?
or
Both? In which order?

The last thing for today is a little clarification.

Although I frequently try to make you think about how we, the human beings, do things, it doesn't necessarily imply that our future AI has to be biologically inspired.

The purpose is to dress a holistic view on the richness of our perception, the implication of combined sensory input and the need to rely on previously learned things for current interpretations (and actions).

Don't forget to answer the three questions above. At least for yourself. And if you feel the need to express it in a comment, you're welcome.

First chapter Previous chapter Next chapter

Wednesday, January 8, 2014

Personal Archive DIgitization Project

When you're reading this, the final #PADIP count-down has reached the 0.

When this count-down started at 25

there was a pile left of 25 cm of documents to be scanned.

The last 25 cm pile of documents to be scanned.

So what is this Personal Archive DIgitization Project ?

Several years ago I decided that it was time to get rid of the numerous piles of papers (and boxes) I conserved since the age of 14. Archives of 43 years.

Before that time there is not so much, just a few things from the archives of my parents that made it until now. Some of which go back to my birth year. Like this one:

The card my mother made with the evolution of my weight since my birth.

I'm talking about the scanning of 14000+ documents (estimated at 23000 pages) and some 6000 photo's.

My first photo's are from March 10, 1966: several shots from the live television transmission of the wedding of Princess Beatrix (our former queen).

Most of the photo's were scanned from the negatives by an external company though (and there are still some 400 photo's and 300 diapositives left to do).

These figures are very low compared to Google's millions of book scans, but as a personal project they are considerable. It least it took me several years working on it in my already very busy spare time.

And nostalgic, I conserved two folders of paper documents of a particular emotional value.

Phase 1 of PADIP : get rid of (most of) the paper (and make it available for recycling). => Done

OK, now I have a hard disk with these files (and backups of course), and an operating system that can easily find what I'm looking for in the documents from the typewriter and printer ages. Thanks to a relatively performant OCR.
For the handwritten material the scans have to rerun through the next generation of OCR software. Actually the only clues are in the meaningful file names.

Parallel to these scanned documents there are of course the digital age sources, and some digital record management headaches ahead. These digital archives start in 1986 and are continuously growing. Numbers? No precise idea yet but a rough estimation brings me 20000 emails, 10000 documents and 16000 digital photo's.

There is also some structured data available. Just two small examples: since 2008 (my first iPhone) every 5 seconds my position is recorded when I'm on the road. That are 1800+ trails. And a database of my 1500 books with the date I bought them, price, pages, dimensions etc. (Yet another scan project ?)

Perhaps I should ask Google to be able to download my browser history, and the telephone companies to download my cellular positions (from before 2008). It would be nice to have them also. NSA has them, so why not me?

So now I have my own personal Big Data. But most unstructured so not very useful, yet. But the next phase in this project will solve that.

In this post I was talking about information extraction, and in another series of posts about artificial intelligence. And for the past 20 years I've been working on semantic network technologies (comparable to Google's knowledge graph).

Got the global picture?

Fine.

The trick will be to build a coherent picture of all the information available, correcting OCR errors, defining location and time of photo's, distinguishing meetings I've been to from the ones I've been invited for but didn't attend, etc.

The challenge is not in getting the data, but in creating to software to get the right data, or better, getting the data right. Doing this manually is not an option.

This personal Big Data set will provide "food for thought" for the new AI (with a lot of other stuff of course).
And as three languages (Dutch, French and English) (and snippets of Spanish, Italian and Polish) are scattered around in this set, it will also start its learning process to be able to provide in return a language teaching capability for humans as illustrated in the short (sort of prototype app) video.

Challenging isn't it?

#PADIP #ArtificialIntelligence #InformationExtraction #PersonalData

Edit 2014-01-13 Changed digitalization to digitization