Unstructured Information
It's estimated that more than 80% of all the world's information is not accessible over the Internet because it exists in the form of unstructured information.
Unstructured information is neither indexed nor properly formatted for Internet integration.
However, all of that is changing.
Unstructured information is neither indexed nor properly formatted for Internet integration.
However, all of that is changing.
Extreme Makeover: Internet Edition
What this means for the future is that information uploaded to the Internet may no longer be subject to the limitations inherent in traditional websites that are typically created to promote a common theme, subject or specific area of interest.
At present, to share content, documents and files are hand picked, one by one, for uploading to specific Internet sites that reflect the particular interests, objectives or business model of the site owner. Developing a website around a particular theme or concept necessarily restricts the nature of the content that will ultimately be uploaded to the site. As a result, the total content accessible online today represents only a fraction of the total content available for online archiving.
From file cabinets to storage lockers to boxes piled high in businesses, libraries, schools, warehouses, apartments, houses and garages around the world, a potential universe of untapped content exists in the form of hard copy files, documents, film, audio tape, video tape, photographs, etc. However, without a process for capturing and organizing this information, the great majority of this vast knowledge base will never see the digital light of day.
The ability to quickly convert and upload large quantities of data directly to the Internet without the requirement of first finding, joining or creating a content themed website is vital to such an undertaking.
New & Improved Web 3.0
Data Mining for the Masses
User and third party generated data mining templates will be developed to organize and display search results in a style and format most beneficial to the user.
Whether a given template takes the form of a simple photo light box, mp3 music player or even a compilation page that provides a framework for social networking, each search query will provide highly relevant, meaningful data capable of further analysis and manipulation.
A "Presidential Library" For The Rest of Us
Implementation Notes
TARGET CONTENT:
Databases
Documents
Photos
Film
Audio
Video
Search engine/user-friendly interface includes fields for the following tags and selections:
1. Title
2. Copyright notice. Open or restricted content? Content provider could even choose not to release some or all the content until after some future date.
4. Keywords
5. Brief description
6. Whether content provider is in physical possession of original and whether original is available for sale, exhibit, inspection and/or research.
7. Option for content contributors to bulk upload content without completing any of the above, or in the alternative, content contributors could complete only one submission form for an entire bulk upload. Over time, users who access the material can fill in the rest.
AT THE CORE
This process must not rely on content contributors being web savvy or even possessing a computer. In its simplest form, the content contributor simply feeds content into an automated device that is programmed to scan, read, download or otherwise extract content as it is converted and uploaded to the Internet. Such a device could range from an all in one content up-loader intended for home use to much more sophisticated commercial and enterprise applications accessible through service bureaus, kiosks, libraries and schools.
BUSINESS SPIN-OFF: SCANNER SHREDDERS
More data. Less boxes.
Databases
Documents
Photos
Film
Audio
Video
Search engine/user-friendly interface includes fields for the following tags and selections:
1. Title
2. Copyright notice. Open or restricted content? Content provider could even choose not to release some or all the content until after some future date.
4. Keywords
5. Brief description
6. Whether content provider is in physical possession of original and whether original is available for sale, exhibit, inspection and/or research.
7. Option for content contributors to bulk upload content without completing any of the above, or in the alternative, content contributors could complete only one submission form for an entire bulk upload. Over time, users who access the material can fill in the rest.
AT THE CORE
This process must not rely on content contributors being web savvy or even possessing a computer. In its simplest form, the content contributor simply feeds content into an automated device that is programmed to scan, read, download or otherwise extract content as it is converted and uploaded to the Internet. Such a device could range from an all in one content up-loader intended for home use to much more sophisticated commercial and enterprise applications accessible through service bureaus, kiosks, libraries and schools.
BUSINESS SPIN-OFF: SCANNER SHREDDERS
More data. Less boxes.
Googlecentric Universe
Can Google get out of its own way?
Google can move the process along by allowing each one of us to play traffic cop in our respective information intersections while it concentrates on building the digital bridges that will both connect and encourage the unrestricted flow of unstructured information onto the Internet.
Although wildly successful, at some point, it must be acknowledged that Google, YouTube, MySpace and their progeny operate on linear paths in a decidedly non-linear world. While these online applications may share links and, at times, even overlap in form and function, it doesn't change the limitations inherent to their respective underlying structures.
Although wildly successful, at some point, it must be acknowledged that Google, YouTube, MySpace and their progeny operate on linear paths in a decidedly non-linear world. While these online applications may share links and, at times, even overlap in form and function, it doesn't change the limitations inherent to their respective underlying structures.
Semantic Web
Web of Meaning
WHAT COMES NEXT?
Meaning Web + Intelligent Web = The Ubiquitous Web
Stay tuned...
Meaning Web + Intelligent Web = The Ubiquitous Web
Stay tuned...
- Google Hearing Footsteps?
- Google may eventually be displaced as the pre-eminent brand on the internet by
a company that harnesses the power of next-generation web technology, the
inventor of the World Wide Web has said.
UIMA
Unstructured Information Management Architecture
- IBM: The Knowledge Rush
- IBM & DARPA have joined forces in developing next generation technology for the extracting and indexing of unstructured information. It is particularly suited to enterprise applications.
Open Letter To Google
September 19, 2009
Dear Google:
In the process of getting my house ready for sale, I shredded approx. 100 storage boxes of personal data (i.e., photos, letters, papers, reports, videotapes, cassette tapes, etc.) that covered a span of 53 years. From birth to law school to the present. With few exceptions, the great majority of my life's data footprint is gone.
So, while your company is on a mission to archive millions of books and various other publications that aren't going anywhere anytime soon, billions upon billions of irreplaceable pieces of information are vanishing from the face of the earth forever. It's NOT about me and my data. It's about the aggregate effect of losing so much information from so many people - day after day after day with no end in sight. Not taking an aggressive position to preserve the historical record of humanity on a personal, individual basis is a loss, the true dimensions of which may be recognized only in hindsight. In the absence of such a vast database, consider the research that will never be conducted - the questions left unanswered. Dots that if connected, would most certainly have revealed as yet undiscovered patterns leading to new information and insight.
I know that you provide for the upload of individual pieces of unrelated information, but in it's current form, it will not make a meaningful difference on the global scale I have described. Such an undertaking requires so much more.
Best regards,
Trendgineer
Dear Google:
In the process of getting my house ready for sale, I shredded approx. 100 storage boxes of personal data (i.e., photos, letters, papers, reports, videotapes, cassette tapes, etc.) that covered a span of 53 years. From birth to law school to the present. With few exceptions, the great majority of my life's data footprint is gone.
So, while your company is on a mission to archive millions of books and various other publications that aren't going anywhere anytime soon, billions upon billions of irreplaceable pieces of information are vanishing from the face of the earth forever. It's NOT about me and my data. It's about the aggregate effect of losing so much information from so many people - day after day after day with no end in sight. Not taking an aggressive position to preserve the historical record of humanity on a personal, individual basis is a loss, the true dimensions of which may be recognized only in hindsight. In the absence of such a vast database, consider the research that will never be conducted - the questions left unanswered. Dots that if connected, would most certainly have revealed as yet undiscovered patterns leading to new information and insight.
I know that you provide for the upload of individual pieces of unrelated information, but in it's current form, it will not make a meaningful difference on the global scale I have described. Such an undertaking requires so much more.
Best regards,
Trendgineer
Unstructured Information: Latest News
- A word about Case Management
- I've highlighted the key term in the quote, ECM is about managing unstructured information, such as documents, audio files, pictures and video files. They are not intended to be used as a Database, although each one can be used in such ...
- Your next employee (Part Two)
- By the 1990s, technology had developed into delivering unstructured information to any consumer through the worldwide web. And today, technology focuses upon the production of unstructured information for any consumer using social ...
- Unstructured Analytics - A Major New BI Market Emerges « Mike ...
- With more and more unstructured information not only on the public internet but also in the enterprise the need to manage this information and extract knowledge from it is increasingly in demand in commercial enterprises. ...
- Taxonomy Classification: Speeding Up Business
- There is rarely a situation in a modern business when a company does not have a need to classify, manage and make use of previously unstructured information. The need to categorise data is only the first priority; ultimately the ...









