Scouring billions of hyperlinks for six years confirmed us the net is each increasing and shrinking

The net world is repeatedly increasing — all the time aggregating extra companies, extra customers and extra exercise. Final yr, the variety of web sites registered on the “.com” area surpassed 150,000,000.

Nonetheless, greater than 1 / 4 of a century since its first business use, the expansion of the net world is now slowing down in some key classes.

We carried out a multi-year analysis challenge analyzing world traits in on-line variety and dominance. Our analysis, printed at present in Public Library of Science, is the primary to disclose some long-term traits in how companies compete within the age of the net.

We noticed a dramatic consolidation of consideration in direction of a shrinking (however more and more dominant) group of on-line organisations. So, whereas there may be nonetheless progress within the features, options and purposes provided on the net, the variety of entities offering these features is shrinking.

Internet variety nosedives

We analysed greater than six billion person feedback from the social media web site Reddit courting again to 2006, in addition to 11.8 billion Twitter posts from way back to 2011. In complete, our analysis used an enormous 5.6Tb trove of information from greater than a decade of worldwide exercise.

This dataset was greater than 4 instances the scale of the unique information from the Hubble Area Telescope, which helped Brian Schmidt and colleagues do their Nobel-prize successful work in 1998 to show the universe’s growth is accelerating.

With the Reddit posts, we analysed all of the hyperlinks to different websites and on-line companies — a couple of billion in complete — to know the dynamics of hyperlink progress, dominance and variety by the last decade.

We used a measure of hyperlink “uniqueness”. On this scale, 1 represents most variety (all hyperlinks have their very own area) and 0 is minimal variety (all hyperlinks are on one area, akin to “”).

A decade in the past, there was a a lot better number of domains inside hyperlinks posted by customers of Reddit, with greater than 20 totally different domains for each 100 random hyperlinks customers posted. Now there are solely about 5 totally different domains for each 100 hyperlinks posted.