Useful Pipes for the Lazy Man’s Job Search
admin
Like 350.000 of my dear countrymen I’m currently searching for a job. This translates to: reading job-advertisements, applying for jobs, hoping for the opportunity of a job-interview, performing as good as possible in the interviews and hoping to finally get a decent job. The process is lengthy and repetitive but actually not boring. I always feel that I have learned something about myself after each job-interview. Most of the jobs which are relevant for me are posted on job-pages like monster.at and are delivered to me via email. But some of the companies (mainly the big ones) don’t post all their vacant jobs their. They have a section on their web-site (e.g. the APA career page) where they maintain their open positions. So if you want to know if any of these companies has announced a new open position on their web-site you have to check it regularly. This task is quite boring, time consuming and seldomly leads to new insights (companies don’t announce new jobs that often). In other words it’s a waste of time.
I was actually quite surprised that even though most of the big companies that maintain their own career pages use state of the art content management systems, none of them thought it could be useful for job-searchers to provide an RSS feed. RSS feeds are convenient because if you subscribe to them they keep you informed about updates of a web-site (without you having to read/access it directly). So I was thinking if it’s possible to automatically generate RSS feeds from such career-pages with very little effort (i.e. no programming). The first thing that came to my mind was to use Yahoo Pipes. To be honest, it always tempted me to do something with Yahoo Pipes, but I never had a specific problem at hand to use them for.
The Yahoo Pipes project was launched in 2007 as an effort to help internet-users adopt (manipulate) the content of web-pages to their needs without requiring them to to know or learn a programming language. Yahoo Pipes lets you pick from several input sources like a web-page or a google spreadsheet (CSV) document and run their content through several processing elements that filter and/or change the content. The processing elements of a pipe are put together (wired) using a graphical editor. Each pipe element has its special attributes that can be customized by the pipe-creator. The output of every pipe is an RSS feed which can be subscribed to using a feed aggregator like Google Reader or a web browser like Firefox.
The following picture shows a pipe that generates an RSS feed for the APA career page mentioned above. The first processing element (upper left corner) is “Fetch Page” where the URL of the career-page is specified. Further the markers are specified which tell the processing element which part of the web-page (the HTML code) to take. Also a delimiter is specified that seperates the continous input into seperated items (the open positions).
The second processing element “Regex” performs some cleanup in the HTML code of the seperated items. It deletes image tags and line feeds. Until this point each item consists just of one field “content” that contains the extracted HTML code from the web-page. The “Rename” processing element makes two copies of the “content” field (date and jobtitle) and renames the original field to joblink. Another “Regex” processing element does the main work. It extracts the date of the “date” field (for each item) and the name of the job offer as well as the link to it. All the processing elements until “Create RSS” fullfill the purpose of deleting the first and the last item of the data stream. “Create RSS” does some renaming of the fields in the items. The output of this pipe looks like this:
And if you subscribe to it using Google Reader like this:
As you can see in the picture above I also created pipes/feeds for:
- Bundesrechenzentrum
- Raiffeisen Zentralbank
- Raiffeisen IT
- Wien Energie
- Österreichische Kontrollbank
- Österreichische Nationalbank
So instead of having to go to all of these web-pages seperately and check if there’s an update, I just check the Google Reader which tells me this information in a fraction of the time. You’re invited to use these pipes and alter them.
Good luck with your job search!
Posted in Uncategorized, computer science, data mining, music, sports, work |
No Comments »














