Artificial Intelligence | News | Insights | AiThority
[bsfp-cryptocurrency style=”widget-18″ align=”marquee” columns=”6″ coins=”selected” coins-count=”6″ coins-selected=”BTC,ETH,XRP,LTC,EOS,ADA,XLM,NEO,LTC,EOS,XEM,DASH,USDT,BNB,QTUM,XVG,ONT,ZEC,STEEM” currency=”USD” title=”Cryptocurrency Widget” show_title=”0″ icon=”” scheme=”light” bs-show-desktop=”1″ bs-show-tablet=”1″ bs-show-phone=”1″ custom-css-class=”” custom-id=”” css=”.vc_custom_1523079266073{margin-bottom: 0px !important;padding-top: 0px !important;padding-bottom: 0px !important;}”]

Thwarting Digital Ad Fraud at Scale: An Open Source Experiment with Anomaly Detection

Ad fraud continues to be a thorn in the side of digital advertisers, as bot traffic and fraudulent human activity falsely inflate ad statistics. Such activity forces unwitting brands to shell out for clicks or impressions that have no hope of reaching – let alone converting – potential customers. A recent study by White Ops and the Association of National Advertisers (ANA) finds that advertisers lost around $5.8 billion to ad fraud last year. On the brighter side, the study also found that efforts to defeat ad fraud – using techniques such as Anomaly Detection – are now more successful than ever. In fact, ad fraud would have amounted to advertiser losses of more than $14 billion in 2018 if not for improved anti-fraud initiatives. While the study estimates that digital ad fraud attempts currently represent somewhere in the neighborhood of 20-35% of all ad impressions, the amount that gets through (and which advertisers actually pay for) is significantly smaller due to improved mitigation.

Read More: The Importance of AI in an Omnichannel CX Strategy – and How to Make It Work for Your Business

But there remains a long way to go, and one of the most widespread sources of digital ad fraud remains non-human traffic. As a rising practice, bad actors enlist botnets to execute fraud by directing an army of connected devices to visit sites and peck away at ads. In other cases, real humans use click farms to receive compensation for producing clicks and impressions with no intention of considering the ad content.

Still, other ad fraud scenarios involve non-viewable ads or ads which honest users cannot interact with correctly, such as ads that are unable to be closed. Importantly, in each of these ad fraud varieties, anomalous data is produced that Anomaly Detection systems can recognize in order to impede fraud and protect advertisers’ bottom lines.

All that said, the vast scale of advertising data generated through websites and ad networks adds a monumental data challenge to the task of implementing an ad fraud Anomaly Detection system – requiring appropriate computational, scalability, and performance capabilities. Instaclustr, the company I work for, doesn’t sell ad fraud solutions but we recently completed a purely experimental Anomaly Detection application that shows how scalable Open source technologies might be able to spare advertisers from the costs and harm of ad fraud.

Related Posts
1 of 376

To achieve the requisite capabilities while keeping the experimental solution cost-effective for practical usage, our test system used an architecture comprised of open source Apache Kafka, Apache Cassandra, and our Anomaly Detection application. Beyond the performance, scalability, and affordably Kafka and Cassandra provide, both Open-source data technologies also offer a particularly high degree of compatibility and pair well together.

Our experiment assembles Kafka, Cassandra, and our Anomaly Detection application in a Lambda architecture, in which Kafka and our streaming data pipeline are the speed layer, and Cassandra acts as the batch and serving layer. In this configuration, Kafka makes it possible to ingest streaming digital ad data in a fast and scalable manner, while taking a “store and forward” approach so that Kafka can serve as a buffer to protect the Cassandra database from being overwhelmed by major data surges. Cassandra’s strength is in storing high-velocity streams of ad metric data in its linearly scalable, write-optimized database. In order to handle automation for provisioning, deploying, and scaling the application, the Anomaly Detection experiment relies on Kubernetes on AWS EKS.

In the end, the experiment was a successful one. The Anomaly Detection application has demonstrated the ability to process 19 billion real-time data events in a day, likely meeting the ad fraud detection needs of even the largest brands. To reach these results, we scaled the application from an initial three Cassandra nodes all the way out to 48. At the same time, the experiment made use of 574 CPU cores, counting all Cassandra, Kafka, and Kubernetes clusters. The experimental application proved capable of maintaining a peak 2.3 million writes per second into Kafka, amounting to a sustainable 220,000 anomaly checks every second.

By teaming Open-source data-layer technologies like Kafka and Cassandra and making the most of the intrinsic benefits each has to offer, this experiment demonstrates a successful method for advertisers and ad networks to use for their own needs – a path for affordable, scalable, high performance Anomaly Detection applications that ensure the integrity of the ad metrics they pay good money to achieve.

Read More: Email Marketing Isn’t Dying, It’s Already Dead

  1. […] Read More: Thwarting Digital Ad Fraud at Scale: An Open Source Experiment with Anomaly Detection […]

  2. Donnette Wedderspoon says

    When someone writes an piece of writing he/she keeps the

  3. Sammy Rosenstock says

    Thanks for every other informative website. Where else may I am getting that type of information written in such an ideal manner? I have a venture that I am simply now running on, and I’ve been on the look out for such information.

  4. Tiktok Video Downloader says

    I’m not that much of a online reader to be honest but your blogs really nice, keep it up! I’ll go ahead and bookmark your site to come back later. Cheers

Leave A Reply

Your email address will not be published.