Pablo Hoffman
Curriculum Vitae
Updated: December, 2011
Contact information
| Full name | Pablo Hoffman |
| Address | Not included in the online version of this CV |
| Telephone | Not included in the online version of this CV |
| E-mail |
|
| Jabber/GTalk |
|
| Skype | pablohoffman |
| Web | http://pablohoffman.com |
Personal information
| Nationality | Uruguayan |
| Resident of | Montevideo, Uruguay |
| Birth date | 7 Jan, 1982 |
| Marital status | Single |
| Languages | Spanish (native) English (fluid reading, speaking and writing) |
Education
B.S. in Electrical Engineering (Electronics profile)
Universidad ORT
Montevideo, Uruguay
Admission in 2000
Graduation in 2006. 83% GPA
Work experience
| Period | 2010 - present |
| Company |
Scraping Hub |
| Business |
Web crawling and screen scraping startup
|
| Web |
http://www.scrapinghub.com |
| Email |
|
| Position/roles |
Co-founder
|
| Period | 2007 - present |
| Company |
Insophia |
| Business |
Information Technology and Services
|
| Web |
http://insophia.com |
| Email |
|
| Position/roles |
CTO, director & co-founder
|
Extended work experience not available in the online version of this CV.
Teaching experience
| Period |
2006 - 2007 |
| Institution |
Universidad ORT Uruguay |
| Courses |
Computer networks |
Personal projects
Scrapy
I founded (and currently lead) Scrapy, an open source web crawling & screen
scraping framework for Python, available at http://scrapy.org
Computer based PC oscilloscope
- Based on Microchip PIC18F4550 microprocessor
- GPL licensed
- Hardware specifications and source code available at http://pablohoffman.com/oscusb/
Open source news aggregator
- An aggregator for news sites which uses feed readers and web scraping techniques
- Similar to Google News, but consisting of several custom-written crawlers for each site
- Intended to force local media sites to adopt web standards
- Written in PHP+Python (version 1.0) and Django+Python (version 2.0)
- Source code available at http://http://code.google.com/p/noti/
- Production site at http://notiuy.com and http://beta.notiuy.com (version 2.0)
Network-based Linux block device
- Linux module implementing a block device with remote storage over a TCP connection
- Source code will be released under GPL once the college semester is over (March, 2007)
Personal interests
- Writing - blogging since Jul. 2005 (www.vampiroz.org)
- Reading - mostly fiction, science and technical literature
- Open source development
Experience with computer technologies
Information Technology
- Web servers administration using Apache HTTP server
- Mail servers (MTA) using qmail and Postfix
- Database servers administration (MySQL and PostgreSQL)
- DNS servers with master/slave redundancy using BIND 8 and 9
- LDAP design and deployment of directories and authentication backends, using openldap
- VPNs design and installation using openvpn
- VoIP solutions using the SER (www.iptel.org/ser) software and Sipura (www.sipura.com) or Cisco ATA devices
- Linux clusters deployment using DRBD (www.drbd.org) and Heartbeat (www.linux-ha.org)
- Server monitoring systems using Zabbix (www.zabbix.org) and Bigsister
- Surveillance over IP using self developed software, network cameras, Geovision video server (www.geovision.com.tw), and Motion (motion.sourceforge.net)
- Network Firewalls with bandwidth control using Linux (iptables/shorewall/tc), and FreeBSD (packet filter/altq)
- Jabber (instant messaging) servers using jabberd2 and ejabberd
- Helpdesk (ticket system) deployment using OTRS (www.otrs.org)
- Collaborative wiki platforms using TWiki (www.twiki.org) and Mediawiki (www.mediawiki.org)
- Centralized backup systems using Bacula (www.bacula.org)
- Hard disk redundancy using RAID (Linux) and geom (FreeBSD)
- Source Code Management for tracking software projects using Trac (trac.edgewall.org)
- Blogging platform multi-site deployment using WordPress (www.wordpress.org)
- Anti-SPAM control using spamassassin, dspam (dspam.nucelarelephant.com), and RBL checks
- Anti MSN/P2P control using Squid and firewall blocking
- Automatic Windows PC deployment using opensource tools (parted, partimage)
Software development
- Hadoop, a framework for distributed data processing using MapReduce paradigm
- Xapian, a search engine library
- Web development using PHP/MySQL, mod_perl and Django/PostgreSQL (MVC web framework for python similar to Ruby-on-Rails, www.djangoproject.com)
- Web design following W3C standards (XHTML, CSS, DOM, ECMAScript)
- Low-level microcontroller programming for Microchip PIC using Assembler and reduced C compiler
- Linux internals developing a kernel module for a network block device
- Multi-platform GUI application development using python and wxWidgets
- Regular expressions for text processing and parsing
- Inter-language bindings (in particular, python to C bindings)
- Version control systems: Proficient in Mercurial and SVN
- Good coding style devotee
Computer languages
- Proficient in: Python, SQL, XML/HTML/CSS, shell scripting,
- Good experience (but rusty) experience in: C, Java, PHP, Perl
- Learned in the past but forgotten: C++, Assembler (8086), Matlab, Clipper, Pascal, Modula
- Open to learn new languages, if they fit better for the task in question
Operating systems
- Server level: wide experience with Linux (Debian and Redhat), Have used FreeBSD too, but I'm a bit rusty on it.
- Desktop level: wide experience with Linux, Mac OS X and Windows
- Embedded level: wide experience with openwrt, a Linux distribution for the Linksys WRT54G device
References
Work and personal references not included in the online version of this CV.