Article 68EQK Massive Yandex Code Leak Reveals Russian Search Engine's Ranking Factors

Massive Yandex Code Leak Reveals Russian Search Engine's Ranking Factors

by
janrinok
from SoylentNews on (#68EQK)

Freeman writes:

https://arstechnica.com/information-technology/2023/01/massive-yandex-code-leak-reveals-russian-search-engines-ranking-factors/

Nearly 45GB of source code files, allegedly stolen by a former employee, have revealed the underpinnings of Russian tech giant Yandex's many apps and services. It also revealed key ranking factors for Yandex's search engine, the kind almost never revealed in public.
[...]
As detailed by Buraks (in two threads), Yandex's engine favors pages that:

  • Aren't too old
  • Have a lot of organic traffic (unique visitors) and less search-driven traffic
  • Have fewer numbers and slashes in their URL
  • Have optimized code rather than "hard pessimization," with a "PR=0"
  • Are hosted on reliable servers
  • Happen to be Wikipedia pages or are linked from Wikipedia
  • Are hosted or linked from higher-level pages on a domain
  • Have keywords in their URL (up to three)

I'm not sure how different these differ from our own search engines. Does anyone have any insights?

Original Submission

Read more of this story at SoylentNews.

External Content
Source RSS or Atom Feed
Feed Location https://soylentnews.org/index.rss
Feed Title SoylentNews
Feed Link https://soylentnews.org/
Feed Copyright Copyright 2014, SoylentNews
Reply 0 comments