What is it Deep internet or invisible internet? Is it some kind of Bermuda Triangle where only a privileged few can get in and out or is it some kind of myth like Atlantis?
The truth is much simpler, flatter and simpler.
El internet profundo básicamente no está indexado por search engines o directorios. Dicho de otra forma, son páginas o más bien repositorios de información, regularmente bases de datos dinámicas, cuyo contents no puede ser revisado por los motores de búsqueda y por tanto incluido en sus resultados de búsqueda.
A diferencia de otras páginas Web, estas bases de datos no son accesibles debido a que necesitan un Username o contraseña para ingresar a ellas o son páginas dinámicas, es decir que solo liberan información y resultados cuando se completa una serie de variables y en ese momento se crea la tabla de datos y no antes.
Por ende, no son accesibles para un Search Engine común.
To keep it simple, keep this idea:
The deep Internet is made up of all the information and databases to which search engines and directories do not have direct access.
In reality, it is more than likely that you are already using it or have used it without realizing it.
Sea cual sea el caso, si deseas saber qué es y cómo usarlo, lee el post hasta el final donde encontrarás un obsequio.
How big is the Deep Internet?
No one knows the exact size of the deep internet or the invisible web. According to a Wikipedia post on the Deep Web:
In the year 2000 was estimated3 than the size of the Invisible internet was 7500 Terabytes data in approximately 550,000 million documents.4 For comparison, it is estimated that currently the Shallow internet occupied 167 terabytes and the content of United States Library of Congress it contained about 3,000 terabytes that could not be accessed by search engines.
Estimates based on extrapolation from a study of University of California at Berkeley speculates that today the Deep internet It should be around 91,000 TeraBytes.5
Size doesn't really matter. The main idea is to be clear that approximately 95% of all the information that exists on the Internet is not indexed by any search engine.
Wow, that leaves a lot of information space that we can take advantage of! You do not think?
The question is how? Well, that's where competitive intelligence systems that specialize in compiling this kind of information come from the deep internet, but that's a different story.
Why is Deep Internet Important?
Bueno, si lo piensas bien, la idea de ceñirte solo a las búsquedas de Google es muy atractiva. ¡Qué digo, es más que atractivo!
It would be perfect if I could ask Google anything and spit out the results I need.
In spite of everything, if we consider that Google and other search engines are governed by algorithms and these cannot be perfect, at the same time as the fact that there are databases and dynamic pages, since we are faced with the limitation that a single tool it cannot index all information. It is basically impossible.
So we can be sure that there is much more information than we were initially aware of.
From this dynamic, to locate certain quality information, we have to have access to the invisible web and take advantage of it.
The good news in all this is that not everyone has access to an unviable Internet and if we know how to do it, we will be able to enjoy better data and information than our competition. It is simple
Where to start ...? Some deep internet resources
Por suerte para todos, hay otras persons que han hecho la misma pregunta que nosotros y nos han dejado un montón de entradas de deep web que podemos aprovechar:
Scientific resources of the deep Internet or the invisible Internet
- The knowledge network- It is one of the largest dating databases in the world with more than 54 million records
- Elsewhere: it is a repository with more than 2000 medical and health journals
- Science Direct: more than 2,500 scientific journals and more than 11,000 books
- Pubmed: is the search engine of medline. Contains more than 22 million posts on biomedical research
- Ingenta: contains magazines with more than 12,000 publications
- PTO estadounidense: es un motor de búsqueda de patentes y marcas comerciales de los Estados Unidos de América
- Espacenet: is a patent search engine for European countries
- Latipat: Espacenet platform adds patent results from Latin American countries, Spain and Portugal
Statistical resources of the deep Internet or the invisible Internet
- Eurostat: fuente estadística de todos los países europeos
- Use.gov: US statistical source.
Financial Data Resources of the Deep Internet or Invisible Internet
Deep Internet or Invisible Internet International Trade Resources
- Comtrade: United Nations database on import and export data and HS codes
- Cameradata: Spanish database on import and export damages
- Market access database: data on tariff rates in the different export destination countries
- The world trade organization: collects legal information on international trade.
Resources on Deep Internet or Invisible Internet Legislation
As you can see, many of these resources are well known and not hidden anywhere.
What happens is that the search engines do not collect the content.
Anyway, these are just a few examples and they don't even reach a small fraction of what exists. Actually new tools or deep web access directories que tardan en llegar al domain público.
Other invisible web resources
Here we should talk about various functions such as:
- Complete planet: A tool considered one of the main gateways to the deep Internet for many years. This is a directory with more than 70,000 databases and resources.
- Infomine: a University of California resource containing over 100,000 links to other databases
- Scirus: is a meta-engine of scientific research specialized in research institutes and universities
This is the case of the souligner which is a large part of the valid resources of the deep Web from the libraries and the centers of the university education and that, consequently, the information is of a large quality and of a large value for the scientific community and research.
Even though we can also, as we have seen, find valuable resources for the company. At the same time of these three resources, you can refer to this summary Compilation of doors of link to the deep internet of Ernesto Marrero.
Also I suggest you use EYE to launch a simultaneous search on several of these services. It is very useful.
How to Take Advantage of the Deep Internet
The truth is that it is not easy and depends on finding valuable resources for yourself or your business.
What I can guarantee you is that when identifying them, it is very important to pay attention to the frequency with which we use these information sinks.
If, as an example, we have found one of them and we use it repeatedly with identical or very similar searches, it is better to have a system that automatically retrieves this information by automatically repeating those searches.
This is what competitive intelligence systems do. They act as a specific search engine for one or more of these deep internet directories, collecting the information that has been provided. It would be like programming a search engine to repeat the hundreds of searches that have been entered for 24 hours.
You intend to take advantage of the deep internet
It's possibly not just about the deep web or the internet.
There is much more
The links that I have presented in this post are just the tip of the iceberg of the invisible web.
If you want to continue understanding better how it works, here is our surprise gift:
At the same time, I leave you a link to this deep web white paper.
It is a bit outdated, but it will help you better understand this part of the web.
Para terminar, aclare que la web actual tiene diversos capas e inclusive parte de esta deep web o deep internet no es alcanzable con browsers convencionales.
There we have to use TOR but we leave that for another time.
What do you think of this post? Do you know of any deep web resources you want to share?