Jason Y. Zien Address: Mountain View, CA 94043 E-MAIL: jasonzway-resume@yahoo.com Phone: 650-210-0095 JOB INTERESTS ------------- Information retrieval, contextual advertising, text indexing, search engines, scalable systems, algorithms, performance tuning, internet technologies. EDUCATION --------- University of California, Santa Cruz, M.S. Computer Eng. 1993, Ph.D. 1997 Received UCSC Regents' Fellowship Award Carnegie Mellon University, B.S. Computer Engineering, May 1991 CMU Eta Kappa Nu Engineering Honor Society WORK EXPERIENCE --------------- Yahoo (February 2008 - present) Senior Scientist Contextual advertising architecture and infrastructure development. Kosmix.com (February 2005 - January 2008) Member Technical Staff, kosmix.com, feedup.com, righthealth.com Architected and implemented an inverted file (index) of the web. Architected and implemented a real-time incremental news and blog index. Built KosmosQ, distributed job queueing and scheduling system. Implemented automatic search query relevance evaluations using TRELs. Architected and implemented a search result contextual snippet server. Built first version of the web front end for feedup.com in Ruby on Rails. IBM Almaden Research Center (July 1997 - February 2005) Research staff member (1999 to 2005), Software engineer (1997-1999) IBM OmniFind Search Engine and Trevi (2002 to 2005) ------------------------------------------------------ Indexer team lead, designed and impemented a high performance text indexer, improving upon the Webfountain indexer. Research in query refinement and indexing algorithms. Helped to deploy an IBM intranet search engine. Helped deliver IBM's OmniFind Search Engine product. IBM Webfountain (1999 to 2001) ------------------------------ Indexer team lead, architected and implemented a large scale text indexer and search engine, written in C++, deployed on a cluster of Linux machines to index several billion web pages. Implemented a scalable, high performance document storage system for efficiently storing and retrieving several billion documents. IBM Cyberspace Technologies (1997 to 1999). -------------------------------------------- Work in embedded systems, Java software development, Java performance optimization. Created an efficient approximate string matching algorithm for server-side detection of errors in web URLs. This algorithm was used for fuzzy search on OCR text for the Proceedings of the of the IACR CD-ROM. Created HTMLTemplates, a Java library for generating dynamic server-side web content. Implemented a subset of the SMB file sharing protocol in Java. Designed a circuit board based around a PIC 16C74 8-bit processor for an web-enabled soda machine shown at the Sydney 2000 Olympics. Designed a pocket-sized storage server demoed at Comdex 1999 and 2000. Graduate Researcher, University of California, Santa Cruz (July 1991 - 1997) M.S., Ph.D. thesis work in multi-way spectral graph partitioning. Designed innovative new algorithms for partitioning large circuits efficiently. Also, developed `assign', a program to optimize routability of FPGAs. Online Media (Nov 1996 - Feb 1997) Unix software consultant. Designed an online faxing system and dynamic web site based on an Oracle database which was used as an online B2B marketplace for buying and selling advertising time on radio and television. Implemented on Solaris, included PL/SQL programming and C/C++ cgi-bins. Internet Media Services. (October 1994 - June 1995) Head Programmer. Developed user-tracking software for the WWW based on URL rewriting. Implemented libraries for interfacing MSQL databases to the web. Designed a proprietary server-side scripting language for web sites. C, C++, cgi-bin programming, interfacing of SQL databases to the WWW. Summer Intern, NASA Ames, Computer Sciences Corp. (June-August 1994) Worked on graph partitioning. Summer Intern, Xilinx (July 1993-September 1993) Advanced Development Group, wrote software for circuit partitioning of FPGAs. COMPUTER LANGUAGES and TOOLS ---------------------------- Java, C++, C, SQL, HTML, Perl, Awk, CVS, Purify, Insure, Oracle, Linux, UNIX Committees ----------------- WWW2005 Program Committee Member, Search Track Internet Conference 2004 (IC'04) Co-chair, Search Track. SELECTED PUBLICATIONS --------------------- M. Fontoura, Jason Zien, R. Lempel, R. Qi, Inverted Index Support for Parametric Search, IBM Research Report RJ10329, October 2004. M. Fontoura, E. Shekita, Jason Zien, S. Rajagopalan, A. Neumann, High Performance Index Build Algorithms for Intranet Search Engines. VLDB 2004 R. Kraft, J. Zien, Mining Anchor Text for Query Refinement Proceedings of the 13th conference on World Wide Web (WWW2004) A. Broder, D. Carmel, M. Herscovici, A. Soffer, Jason Y. Zien, Efficient query evaluation using a two-level retrieval process. CIKM 2003 J. Zien, J. Meyer, J. Tomlin, J. Liu, Web Query Characteristics and their Implications on Search Engines, Poster Paper in the Proceedings of the Tenth World Wide Web Conferece (WWW10), 2001. J. Zien, Multi-Level Spectral K-Way Graph Partitioning. Ph.D. thesis, University of California, Santa Cruz. June 1997. ASSOCIATIONS: Member of the IEEE and ACM REFERENCES: Available upon request