13 Jul 2018

Freelance Regex enthusiast needed for summer job

MoritzH Anywhere

Job Description

Assist in digitising books of German companies from the 1990ies. These books are firm registers with basic information about firm location, production and key numbers from the balance sheets. Raw data is obtained from runnning high-quality scans through OCR softwares. The job consists of turning these raw text files into structured data!
Requirements:
• Solid proficiency of regular expressions in R or Python
• Basic knowledge with Record Linkage packages
• Experience of collaborating on code
• Willingness to wrestle with messy data
• Availability to skype on regular basis
• Confidential handling of data
• Basic German skills are a plus
Recompensation:
You should be available for at least 40 hours until the beginning of October 2018. The salary is 10 ∗ c (euros/hour), where c ∈[1,2] adjusts for your relevant experience. Payment is preferrably in-kind (i.e. electronic devices, travel expenses etc).

How to Apply

Please send a recent coding sample of yours in R or Python, two lines on your academic background and two lines on your favourite city in Germany (possibly in German) to this email address: [email protected]
Most importantly, please include a regex suggestion in the email to extract zip codes and town names from this string in R:
towns <- c("Home is 1050 Brussels", "been to 1050 Sankt Moritz ? or 98a3 Frankfurt am Main", "nice place 4123 Frankfurt/Oder.", "bad wather: 1234 Bad-Godesberg.")
Thank you!

Job Categories: Lecturer/Teacher. Job Types: Freelance. Job Tags: record linkage, regex, and strings. Salaries: Less than $100,000.

Job expires in 27 days.

Apply for this Job