
The Filter Module is a very powerful module that lets you filter certain data in or out of your Yahoo Pipe. A filter module can contain multiple rules that you specify to block or permit items.
First select whether you want to block or permit items. Then whether you want to match all, or any of the rules you specify.
To create a rule, select an attribute from the dropdown box (ie author, title, description, link) that either:
- Contains
- Does not contain
- Matches regex
- Is greater than
- Is
- Is less than
- Is after
- Before a query
…that you specify.
In Yahoo Pipes, the Filter Module is great for blocking duplicate posts if you have mashed up similar RSS feeds, or only allowing news stories about specific topics.
In this example, lets take a look a look at some cars to fix up and sell. First we can grab a Craigslist feed of cars + trucks in the Vancouver area priced between $1 – $1000. Lets look at every car except Volkswagens (sorry VW fans). We don’t want to see VW listings in our feed, so lets block them out entirely with the Filter module.
First we want to tell the Filter module to “block” items that match “all” of the following:
“Description” “contains” “vw”
“Description” “contains” “Volkswagen”
This ensures that any Craigslist listing with the words “vw” or “Volkswagen” won’t show up in our feed.
We can block or permit items. Match any or all rules. And match items that:
- Contain
- Does not contain
- Matches regex
- Is greater than
- Is
- Is less than
- Is after
- Is before a string
Strings can be entered directly into the Filter module, or wired in from a user input module.
The left side of the rule box contains a drop down list of attributes in our feed to select. This allows us to match rules against different attributes in our feed such as our title, description, link etc.


I am trying to search and filter a large number of RSS feeds for exact words such as “car” and have been using filter operator combined with ‘contain’ but I notice that I also get variations on “car” such as “cart”, “cartoon” … in my results, have tried ‘is’ but get no results. Is there a way to filter for the exact word?
That is nice, but I find it hard to believe that there is no way to input multiple terms WITHOUT opening new “filters”
For example , let´s say I am aggregating news feeds from 6 newspapers.
I want :
Contain “war” OR “football” OR ” “Stocks” OR “Hong Kong” OR “New York” OR “FASHION” (etc …plus 40 more terms)
but NOT contain “Germany” OR “Diamonds” OR “China” OR “Brooklyn” (etcc…)
There is no way to use the FILTER inside a loop, and the tokenizer will not connect to the filter text field .
That means that if I KNOW the terms I want and do not want , I need to open fields for each ? there is no way to input a long string ??
(for example i want only news of football but NOT from germany or china ???