分享好友 最新动态首页 最新动态分类 切换频道
The New Semrush Backlink Database: Bigger, Better, Faster
2024-12-25 13:20

Although the Backlink Analytics tool is one of the oldest features of Semrush, however, we have to admit that it may have been the weaker link of our SEO toolkit. We knew we had to step up our game, so about a year and a half ago, we set on changing the status quo.

The New Semrush Backl<i></i>ink Datab<i></i>ase: Bigger, Better, Faster

Semrush, while being a well-rounded toolkit for digital marketers, has always had a soft spot for SEO. Helping people drive organic traffic from search engines to their content has been one of our most important goals since its inception.

This goal led us to become a world-renowned SEO Suite, allowing us to win multiple industry awards over the years.

SEO is tricky, as it involves complex and intertwined moving parts. To get the top rankings, you have to nail down every single step on-site and off-site. Throughout this entire process, we strive to provide our users with the best solutions.

To stay praiseworthy, we are continuously working on improving our toolkit. Today, we are proud to share with you our latest breakthrough:

We needed a major improvement in the quality of our backlink data. There was no workaround, but a complete overhaul of our data-gathering process. To focus on our end goal, we put on hold the development of all other backlink features and made a huge list of things that would improve our backlink data delivery to clients.

The path was clear, and all we had to do is work on crossing off the items on our list.

We won’t bore you with the technical details of our backlink database’s overhaul, but here is a quick rundown of what was done:

Crawler. After carefully examining the drawbacks and boundaries of the existing architecture, we decided to rewrite our crawler from scratch. And so we did, we have designed an entirely new approach to our data gathering.

Crawling queue. The first tests of our new crawler revealed that its request queue was not properly handling the amount of data it was now collecting. We tried solving this by simply increasing hardware capabilities, but it was not good enough, so we developed a more efficient crawling queue.

Seeding. To provide our crawler with a quality initial seeding, we queued up all the URLs from Google’s Top 100 for 450 million keywords from our Organic Research tool; this ensured that our database was relevant from the ground up.

Storage. Increased data collection obviously demands more storage space — we had to quadruple our server size.

To find out exactly where we are as a backlink provider, we decided to measure ourselves against the best: Majestic, Ahrefs, and Moz.

We will explain the methodology in a second. First, let’s assess our development progress during the past six months. Looking at the relationship between the four top SEO tools, you can see that we have made a giant step forward.

It was not easy to figure out a methodology that would be both clear-cut and fair.

You can always find the domains that will show your backlink tool in a good light, that is why we decided to use a random set of 100 domains (out of 100,000) for each month to show us how the contestants performed during the past six months.

We were looking at the number of referring domains and the total number of backlinks each contestant had for the 100 domains.

Next, for each domain in the test sample, we compared the ratio of Semrush results to the results of our competitors. So, if the ratio is less than 1 — the Semrush database has less information for the test domain. A ratio greater than 1 shows by how many times the Semrush result exceeded.

To get the final score, we calculated the median of all results.

As expected, we’ve had a lot of feedback on this post, and we would like to thank you all for your responses! 

For the most part, the community was very supportive, and one of the first people to give us kudos was Alyeda Solis.

This was followed by a barrage of positive messages, with Gregg Lee putting the cherry on top — Brian Dean has checked and approved our database growth.

Of course, we’ve also had a good share of criticism. Russ Jones claimed to prove us wrong with his own research.

After a bit of back and forth, he revised his conclusion. But we still have to disagree with it.

To quote Russ: “Comparing link indexes accurately is no easy endeavor.”

That’s completely true. First off, getting a truly random sample of domains is an important and very complex part of a quality backlink index comparison. We are really appreciative of the methodology that Russ presented in his article, it’s quite a helpful piece. Yet we cannot agree with the way he assesses and compares the indexes themselves.

The method he uses only shows the likelihood of one index having more data than the other. It does not reveal how much more data there is, which means the method cannot be used for the real comparison (if your goal is to find out which index has more data).

The following are examples illustrating why. 

Example 1:

Let’s say we have backlink indexes for Contestant 1 (C1) and Contestant 2 (C2). 

The comparison for a sample of 12 domains shows that C2 wins every time:

According to Russ’s approach, C2 is the absolute champion. But in reality, the difference between indexes is 0.1%, which means that the C2 and C1 indexes are basically equal. 

Example 2:

This time, let’s say, the comparison shows that out of a sample of 12 domains, C2 has 9 wins, and C1 has 3 wins:

Once again, according to Russ’s approach, C2 here is 3 times bigger than C1. But by looking at the actual index sizes, you can see that for 75% of sample domains the indexes are almost equal (0,1% difference), and for 25%, C1 has a complete victory (C2 has no data). Overall, C1 in this example has a better backlink index.

These examples are extreme, but they do illustrate the flaws of the approach. Without knowing how much data there actually is, you can not claim that one backlink index is more useful for SEO than another.

Our comparison method acknowledges this, as we were calculating the median using the actual numbers of referring domains and backlinks.

Russ kindly shared the sample used in his research so that we could verify it ourselves. The results turned out to be very similar to those presented in our research.

In terms of number of backlinks this graph shows a drastically different picture to what Russ has presented in his research.

We also took the first domain (amotherthing.com) from Russ’s sample and ran it through the interfaces of both Semrush and Moz. 

The numbers proved to be different from those presented by Russ.

Russ’s research:  
Semrush: 28469 backlinks
Moz: 404078 backlinks

Tool interfaces:
Semrush: 37.7k backlinks
Moz: 26.8k backlinks

Anyway, we wanted to thank Russ for his time and ideas, as we believe that healthy competition is a good incentive for us and the industry as a whole.

We have made a huge leap forward with our backlink database, and it feels great to look at the numbers and pat ourselves on the back, but, obviously, it is not just about the numbers.

The quantity of data does not necessarily convert in quality, and we are making a great effort to ensure that our database stays fresh and useful.

Now that we have a new data gathering process, we will build upon it designing new features and capabilities that will make our tools even stronger. Stay tuned for more exciting news!

最新文章
契约锁扫码签怎么用?文件一码生成、一扫即签
遇到招聘会签约、重要会议签到、员工意见书、物流单据、告家长通知书等场景:要和很多人签约,文件内容都是一样的,但是不确定签署方具体是谁、有多少人,怎么签最方便? · 如果签署纸质合同:人工起
如何在国内开通ChatGPT Plus?
ChatGPT Plus是OpenAI推出的ChatGPT高级订阅服务,为用户提供更快的响应速度、更长的响应长度和优先访问新功能。然而,该服务目前仅在美国和部分其他国家/地区开放。对于国内用户来说,开通ChatGPT Plus面临着一些挑战。1. 地理限制:ChatG
上海企业seo外包_上海高端seo外包公司
本文目录导读:上海企业 SEO 外包的优势上海企业 SEO 外包的流程在当今数字化时代,上海的企业面临着激烈的市场竞争,为了在众多【天津网站开发】竞争对手中脱颖而出,提升网站的知名度和影响力,许多企业选择将 SEO(搜索引擎优化)外包给
AI论文润色与检测使用评测分享,提高写作质量
2024 年的柏林,一场盛大的国际学术交流大会正在举行。世界各地的学者们汇聚于此,交流着各个学科领域的最新研究成果与前沿理念。在计算机科学与人工智能的分论坛上,气氛热烈而活跃。德国知名学者汉斯教授站在讲台上,正在分享他关于人工
企业网站推广技巧
企业网站推广技巧包括:一、搜索引擎优化(SEO),通过关键词优化提高网站排名;二、社交媒体营销,利用微博、微信等平台发布内容,吸引用户关注和分享;三、内容营销,制作高质量的原创内容,吸引潜在客户;四、付费广告投放,如百度推广
SilverHostingNetwork 美国纽约VPS 测评:性能与稳定性如何?
SilverHostingNetwork位于美国纽约的VPS服务在性能与稳定性方面表现优异。其服务器配置高端,处理速度快,能够满足各种业务需求。网络连接稳定,延迟低,确保用户获得流畅的网络体验。SilverHostingNetwork还提供专业的技术支持,解决用户
2023年最新Python学习路线及学习目标规划,你想知道吗
找不到完整的学习路线?今天分享2023年Python学习路线及学习目标规划拿走不谢,Python作为今年来特别受欢迎的编程语言,是AI时代头牌语言AI领域的敲门砖,Python已经入驻小学生教材,将来不学Python不仅知识
合肥上门恢复电脑数据,坏硬盘数据恢复,合肥数据急救中心
合肥万维数据恢复公司,合肥专业恢复硬盘误删除、误格式化等数据的找回,多次开盘恢复服务器、台式笔记本电脑、监控视频等存储设备。合肥地区可上门恢复数据,合肥经开区,政务区,滨湖新区,高新区,蜀山区,包河区、瑶海区、庐阳区、新站区数据恢
SEO实战分享:引爆流量的2种视频推广引流方式
       对于一个网站来说,流量的多少直接关系到站长们的收益。随着搜索引擎内容的不断增加,图片、视频引流已经逐渐成为众多SEOer们喜欢的一种营销方式了。特别是视频引流方式,如果做的好,我们可以获得非常
国商之声 | 创新引领未来,设计改变世界
小伙伴们大家好!欢迎收听"天籁之音"广播站2024第13期校园微电台 报告指出,我国生成式人工智能产业蓬勃发展,产业规模和产品数量迅速增加,并逐渐融入人们的日常生活。我国初步构建了较为全面的人工智能产业体系,相关企业超过4500家,核
相关文章
推荐文章
发表评论
0评