Are we reaching our information event horizon?

Are we coming to an information event horizon at which point the speed we collect new information will move faster than our ability to delete it; let alone manage it.

Blackhole

Up to this point the computing power and cost of storage have been a self limiting factor in our ability to hold on to this much data.  Previously only large institutions and kings had vast libraries of information. Now this limitation has gone. When taking pictures at your next family event, you will notice that the memory card in the camera allows you to take hundreds of pictures. Even though many pictures are not worth keeping you will never delete all the bad photographs. Instead you copy the full contents of the memory card to your home computer.

This is not to say that you will use all of these photo’s. It is very unlikely that you will upload all 300 pictures of Aunty Mildred’s birthday to your Facebook account. However, at the same time you will not remove most of the unwanted photographs as there might be something in the future that you decide you will want. This shows that the cost of storage is no longer a limiting factor in technology.

This is only the start of the level of information we will collect over our life time as the cost of storage further reduces.

The Internet and cloud computing is one of the major factors for moving this event horizon closer. The formation of huge, seeming endless storage, hosted over multiple locations means we are not limited by any physical constraints anymore. In fact we are not bound by any single location.  The comical idea of a bookshelf with a collection of books is further removed through latest eReaders that have arrived. These devices in combination with the Internet remove the need for us to physically go to a store to buy books and allow us to get new books without moving form our house. This would have been seen as magic only a few decades ago.

This is not to say that any of these inventions are bad or wasteful, far from it. Technology has enabled many people to study and express their views of an event in ways that have never been possible before. With no censorship people can capture their own version of history. In fact the filming of events and social uprising via video has been one of the strongest benefactor of this new low cost storage.

The real issue is our ability to recall information and manage it. As humans we are have not evolved at the same rate as our technology. Our brain was built with a specific ability to recall facts and automatically filter and remove memory that have no obvious purpose to us anymore. For example not many of us will remember where we parked the car four weeks ago at the supermarket, even though we visit it every week. This can be very useful as we can focus on the information that is most important to us, and remove all the clutter of everyday life.

However electronic storage provides no way of automatically filtering the information that we find useful from the information that we are not bothered about. This does not mean we want a automated delete system, just a better way to find things that are important to us. We need technology to work in a way that more closely models the way our brain works.

So what is an event horizon? The term is used to describe the distance from the centre of a black hole when crossed you can never escape. When inside this horizon the gravitational pull will never let you go. Information technology is starting to behave in a way that mimics this same point of no return.

With more complex smart phones and cheaper recording devices like digital cameras we will soon be able to capture everything that is happening around us and store it at a speed we can never expect to be able to use.

So what is the answer? There are many possible outcomes, smarter ways in which we pick what we want to keep, and smart automation that helps highlight information. However the level of information kept will still be greater than our own brain is able to remember. And searching all of it every time we need something will become slower and slower the more we have.

Therefore we need to change the way in which we retrieve information, we need to find methods that allow us to forget all the details but focus on only a subset of topics that will enable us to obtain the detail that we want. In this way we will not need to hold the full index of the information in our heads just the subset of information that enables us to navigate it.

Search engines have tried to do this, however they are unable to allow the person to truly control the way in which the index is created. Search engines today only allow you to manipulate a set of standard rules that try to provide information in a common way. We all know that the way in which you remember an event is not the same way that another person will remember the event.  Therefore  it is incredibly complex to provide a single tool and method of information retrieval that will work for everybody. It is also why these search engines can only provide meaningful results when the output has be further refined.  The act of entering a search term and then select the third item in the list is a method of refinement. It changes the importance of that entry to the terms over time. This method is used by many search engines to improve the way to find things in the Internet.

This works well when many millions of people look for similar information over a common source (the Internet). However it has been shown that the similar methods do not result in the same level of productivity when used in isolation. For example how many of us have tried desktop search tools to find a file, only to get a long list of meaningless results.  Unless we can narrow the search by dates or file type we do not get a meaningful result.

Therefore a better more natural method of building a reference is required one which is directly tailored to an individual and allows the person to use the way that they think to arrange and mange information.

New taxonomies and linguistic technologies are some of the possible ways forward if they can be made to adapt at the same speed at which we can store the information. Many methods today still require post analysis of the information and manual configuration that make the speed of retrieval not real time. Personal tagging that automatically adapts to your experience and knowledge is a better way in which we will be able to speed up this process. Combined these ideas with better retrieval based on personal key terms will be the next major step forward.

Lets hope the technology  we use to collect information will also find ways to stop us from falling completely into the information black hole that we are creating.

Share and Enjoy:
  • Digg
  • del.icio.us
  • Facebook
  • Google Bookmarks
  • Blogplay
  • LinkedIn
  • Reddit
  • RSS
  • Twitter

Achieving Enterprise Collaboration Requires Vannevar Bush’s Memex

As the number of different types of information continue to proliferate, the need to be able to get a single view of all one’s digital resources is getting harder.  Harder still is to be able to collaborate effectively in the workplace where there are a multitude of structured and unstructured data sources and systems.  But the need for a solution to this problem has long been recognized.

Vannevar Bush laid down his proposal for the Memex over 65 years ago in the shadow of the Second World War when the perceived value of technology was at its zenith but before computers were known outside the scientific community.  Bush’s memex would enable all information to be ‘consulted with exceeding speed and flexibility’ and ‘given a memex, a scholar [user] could create her own knowledge tools as connections within reams of information, share these tools, and use complexes of tools to create yet more sophisticated knowledge.’  How fresh and pertinent these goals still seem!

It is now almost thirty years since the first home computer made its appearance.  Since then there has been a breathtaking rate of change of software and hardware technologies which has revolutionized the home and the office alike.

And the rate of change is not slowing down.  The good news is that there are masses of opportunities to stake your claim for a new idea in this computing Wild West.  The bad news is that we still haven’t hit on the right or even a good way to do certain things.

Take the core task of managing all an individual’s or a business’ information i.e. the sum total of all its different types of electronic resources.  There is no single way of getting a single view of all the different resource types (documents, spreadsheets, presentations, images, emails, attachments, web pages) that we have which is categorized according to the subjects we use.

The file management capability has changed not a great deal in the last three decades.  Sure, it has additional features such as virtual folders, some federation and better search.  But in all the main operating systems, it is still based on a hierarchical approach which doesn’t work well for large amounts of information held in many different places.

Search, of course, has been the upcoming kid on the block for some time.  It works fantastically well most of the time on the web.  But use it on the desktop where many files contain words which are frequently used and one gets a huge amount of results.  This still is useful when you have no idea what document you are looking for or where it is but is somewhat excessive when looking for the file you have just been working on.  And you can’t use search to share data.

The latest solution to the problem, especially in the workplace, is to employ some kind of collaboration tool.  These are great for a specific project in isolation to bring a group of people together.  But when you have to use a number of different workplaces for different jobs, they end up being just another place one has to go to find and upload stuff.  This all adds to an increasing cognitive strain on the belaboured user.

What is really required in the enterprise is to achieve a complete, convergent and instantaneous visibility of all a user’s digital resources no matter where they are and ordered by the topics and groupings that are part of the domain that are being working on.  The user should then be able to share resources associated with particular resources to other users and teams as they wish.  These resources can then be used to create new resources which can themselves contribute to the knowledge base or as part of the process to complete certain tasks.

Which is pretty much identical to the requirement Vannevar Bush laid down.  Incredible that all our advances haven’t yet brought us the complete solution.

Share and Enjoy:
  • Digg
  • del.icio.us
  • Facebook
  • Google Bookmarks
  • Blogplay
  • LinkedIn
  • Reddit
  • RSS
  • Twitter

Hello world!

Welcome to the TopicLog blog, this blog looks at all the technology and tools used today to help people and companies improve data management and searching.

Share . . . → Read More: Hello world!

Share and Enjoy:
  • Digg
  • del.icio.us
  • Facebook
  • Google Bookmarks
  • Blogplay
  • LinkedIn
  • Reddit
  • RSS
  • Twitter