Hate Speech decreases finally?


No, there was no decrease of hate speech in the German social media buzz during the last 6 months. As already seen during the last 2 years, the largest number of hate speech is found in twitter (70%), followed by facebook (10%), blogs and forums.

I have developped a detector which finds public hate speech posts in these pagetypes. Read here, how this detector works.


Here are the data the detector produced for all 4 page types from May to October 2017:

The peaks mark terroristic attacks during this summer. We had a knife attack July 28th in Germany and the Barcelona-car on August 20th.

Due to the dominant role of twitter in the diagram above, Facebook’s performance is somewhat flattened. Here is the diagram only for Facebook:

A decreasing line looks different.

Now, before you interpret and use some of the numbers shown in the diagrams, you should read however the following paragraph.


The detector works very simple, using a common social media monitoring tool. The queries for the detection-research look for posts containing “hate-words”. Hate-words are swearwords which mean people give to others, who they hate, to muslims, blacks, jews, foreigners and so on. You know some of these words of course, I don’t have to list them here. They are all ugly.

Now when a post contains such a hate-word, the post is counted as hate speech.

Does that cover all posts in question correctly? No, it does not.

  • The detector finds post which are no hate speech. For instance if I had given an example of a hate word in this post, this very post would count. It contains one of the ugly words.
  • The detector does not find true hate speech. The phrase: “One should really hang all xyz!” contains no hate word (as long xyz itself isn’t one). Hence, when looking through the detector’s lens, this phrase is no hate speech.

So the real volume of hate speech may very well be quite off the number we have found with the little detector here.

But how relevant is this absolute volume? Much more important to me is, how this number develops. It is quite obvious that its curve will have a very similar shape as the one of the detector. When the true hate speech volume increases the detector will show an increase too – and vice versa.

More information

I have separated the queries by “peer-group”. One query f.i. is called “jew-enemy”, another one “muslim-enemy”. The curves for these queries are not completely parallel and hence provide further interesting insights.

Since the detector now runs since more than two years, I have collected some data from that time. It is avalaible in principal.

Contact me if some of these data are of any interest.


Software-Deficits Decreasing. Hurrah!?

Good News?

In the beginning of this year we were pleased to read: the number of worldwide documented software-deficits went down in 2016 compared to 2015: from 6,400 to 5,600, more than 10% minus! The source was the Hasso-Plattner-Institut in Potsdam near Berlin, Germany.

One month later it seems that some more bugs have been reported and hence the decrease turned into a little increase by then. (yellow = little deficits, red = severe deficits). Too bad.

Software-deficits reported by Hasso-Plattner-Institut on 02-18 / 17

Poor quality anyway

However: Some 100 vulnerablity-deficits more or less seem not so important to me. What really counts is the absolute number of them: 6,500!

The red part of the column display the number of severe deficits – nearly 2,500!

If we’d talk about cars, in these models the breaks wouldn’t work or the steering would block. Thousands of fridges would defrost over night and many nuclear plants would emit much too much radiation. Why is that not the case? These are technologies with a zero-bug-philosophy. Zero-bug is the standard-orientation for any educated engeneer.

But not in the software business. Since more than 30 years we have gotten used to the fact that software-development is a multi-bug-technology. Explicitely and unscrupulously. The patch- and bug-fixing-plans are even part of the marketing-strategy.

It is true: cars, fridges and even nucear plants have problems once in a while. We know product recalls and we know Tschnernobyl and Harrisburg. No fun at all for the management. But: they did not plan it that way!

For software-developers the green banana is the standard business case.

And it’s true too: One reason for this situation is the enormous price-pressure in the IT-business, and in the end that means: we ourselves are responsible. But are we truely more generous and less parsimonious when buying a car or a fridge?

The underlying metric of the statistics above is the CVSS-Index. It measures only vulnerability, “hackability” in a way. The mere functionality (does it what it shall do?) is not concerned. We probaply could double all numbers here if we were to evaluate the deficits in the understanding of some “holistic quality”.

The true reason

I believe the reason behind this absurd situation is that we do not have any official licensing-process for software. Anybody can claim to be a developer and sell his software-products. Thinking of our faster and faster growing dependency of digital environment, this is really wantonly negligant. But which semi-official institution is daring to put up with such a challenging task?

The difficult Germans are not known as first line fans of the digital world. They have been for long and still are quality-fans. The multi-deficit-culture of the software industry surely is one of the reasons for the German’s reserve.

Google Trends visualized – what for?

Recently I stumbled upon this psychedelic scheme:

What is it?

You see the top 25 search-phrases on Google in a near-realtime-visualization. Google calls these labels, names, words or phrases “trends”. Well, maybe they are.

Here is how Wikipedia explains Google Trends: Google Trends is a public web facility of Google Inc., based on Google Search, that shows how often a particular search-term is entered relative to the total search-volume across various regions of the world, and in various languages…..”

What is it for?

Good question. First of all it is coloured and dynamic. Eye-catchy in a way.

It demonstrates which subjects, issues, concerns, hypes, interests and questions are right now current – somewhere on the world. And by this it shows (to me at least) how small my perspective, how narrow my angle is. I do not know half of these words or names, many of them I even cannot read or pronounce. And if by chance I do know a name or word here and there however, in most cases I have no idea, why this one pops up for some seconds and then declines again.

In other words: the world and I do not follow the same trends. So this colourful map tells me to picture myself at the right place: somewhere way out, far off the center.

How does it work?

You like this flashing picture? You’d like to have it on your site? Here is how the psychedelic chessboard comes to your webpage:

This is the central line:


It has to be embedded into your html-code, using the “iframe” command. You might like some calibration for the size too.

So altogether in WordPress (this is WordPress) the full line is:

<iframe src=”http://www.google.com/trends/hottrends/visualize?nrow=5&amp;ncol=5&amp;pn=p1/” width=”918″ height=”576″></iframe>

Copy it and paste it into your content, where it shall appear. That’s it.

Something else?

These coloured trends obviously do not lead to deep insights, they are rather entertaining. This site is based on the same data but a bit more serious. Here you can apply filters for specific regions and subjects. This might suffice as smalltalk-preparation for an international dinner party.

If you are really after the data for your own analysis of trends ands topics, you have here the API, Google’s raw-data-station.

Digital Media Time: Over 50% on Smartphones

Digital media is mobile

Following a recent research-study by comScore every second minute on digital media ist spent on a smartphone.

Social Media, Kugel, 3D, Web 3.0, Text, Bild, Audio, Video, P2P, digitale MedienIf you add tablets the mobile share reaches even 68%, two thirds.

Apps dominate the internet

The large majority of these mobile minutes (85%) is spent on apps, crude web-usage only has a share of 15%.

So the mobile trend is still ongoing. The authors of the study remark however that the old-school PC will keep its relevant role for a number of purposes however.