MGC Spam Filtering

In today’s edition of the Bing Index Quality blog we will delve into one particular spamming technique – MGC (short for ‘machine generated content’.) We will discuss what it is, why & how spammers employ it and introduce a specific update we shipped a few months ago aimed at detecting and filtering out pages utilizing this technique. What is MGC and why/ how spammers employ it? As we mentioned in the Web Spam Filtering overview blog from...
Read More

Blame The Meta Keyword Tag

I blame the meta keywords tag.  That little so-and-so started all this.  Well, the “tag” and a few crafty humans, really. The idea was pretty simple. If the keyword appeared in the meta keywords tag, the page was relevant to the topic. And, on the surface, this was a solid idea. Something useful to the systems, yet invisible to the average person. Alas, with a simple right-click, the beginnings of modern day search optimization was born. “What if...
Read More

Extrapolating Malware Detection with Rollup

Protecting Bing users from malware is a top priority for the Index Quality team. To that end, we analyze every signal available to us and determine not only whether the page is infected, but also whether it runs at a high risk of infection at a future date. One of the key elements of this analysis is discovering clues about potential vulnerabilities on the ‘container’ hosting the page that could be exploited by malware distributors to spread their...
Read More

URL Keyword Stuffing Spam Filtering

As we alluded to in last week’s Index Quality blog, today’s update will focus on one specific spam filtering mechanism we rolled out a few months ago that targets a common spam technique known as URL keyword stuffing (KWS.) What is URL KWS? Like any other black hat technique, the goal of URL KWS, at a high level, is to manipulate search engines to give the page a higher rank than it truly deserves. The underlying idea unique to URL KWS relies on...
Read More

Web Spam Filtering

As I mentioned in the July 15 blog introducing Bing Index Quality, one of the key dimensions of our work is web spam detection and filtering. The overview of our approach to this complex problem will be the focus of today’s update. What is web spam? On the surface, our definition is fairly straightforward and intuitive. We think of a webpage as spam if its owner uses black hat SEO techniques in an effort to game our search algorithms with the goal...
Read More

Resetting Expectations - What Business Today Needs To Learn

Resetting Expectations - What Business Today Needs To Learn

Today’s new businesses have much in common with their older counterparts. The need to acquire new leads, drive revenue, market their product or service. The need to manage overhead, do more with less and stay one step ahead of the competition. So from, say 1974 to 2014, some things haven’t changed much. One thing you never started a business without “back in the day” was a business plan. And that same thinking should be in place today, but in a...
Read More

Bing-Inc Roadshow Final Stop: Houston

The Bing/Inc Roadshow is coming to an end. It’s been a jam packed tour for the last 4 events in Nashville, Chicago, Atlanta and Boston, with capacity crowds, excellent questions and easy access to answers. The panel, with industry heavyweights such as Maisha Walker of Message Medium, Bruce Clay of Bruce Clay Inc, Marty Weintraub of aimClear (and myself) is set to meet up again during the final road show event in Houston, Texas. It promises to be...
Read More

Bing Site Safety Page

Bing Site Safety Page

Malware, and the possibility of infection, is an unfortunate reality facing internet users today. At Bing, we take our job of providing a safe searching experience to our customers very seriously and this anti-malware effort is one of the core elements of the Bing Index Quality charter. We’ve been in this game for many years and have developed comprehensive solutions that minimize user’s risk of infection and maintain the integrity of our index...
Read More

Usability, Content and Calendars: 3 Areas To Understand And Focus On

For every business online come an almost limitless number of areas to learn about, become proficient in and continually work on. Many people end up good in an area because repetition and the “school of hard knocks” teaches them a workable path. And while some areas are no brainers (knowing your product, for example), they still can present legitimate challenges to a business. Areas like Usability often get talked about, but are just as often...
Read More

Filtering Low Quality Links in Bing SERP

The internet can often feel like a giant cesspool of low quality, illegal or malicious documents. The mere mention of malware, adware and viruses is enough to send even the most experienced internet surfers running for cover. And yet, these represent just the tip of the iceberg of poor quality documents. Searchers also have to deal with phishing sites, fraudulent sites or sites propagating scams, spammy sites…. Well, you get the picture. On top of...
Read More