| [pdf version] |
Harish Doddi |
|
Tel: +1 650 391 8164 harishd@stanford.edu http://cs.stanford.edu/~harishd |
| September 2008-March 2010(Expected) |
Masters in Computer Science & Engineering Stanford University, California. |
| 2004 - 2008 |
Bachelor of Techonology (Hons) in Computer Science & Engineering International Institute of Information Technology, Hyderabad(IIIT, Hyderabad), India. CGPA 9.14/10 |
|
cs 140 (Operating Systems) |
cs 276 (Information Retrieval and Web Search) |
|
cs 245 (Database Systems Design) |
cs 345A (Data Mining) |
|
cs 244B (Distributed Systems) |
cs 240 (Advanced topics in Operating Systems) |
|
cs 347 (Transaction processing and Distributed Databases) |
cs 271 (Principles of system design in Bio-Informatics) |
|
Operating Systems |
Linux, Windows, MAC |
|
GUI Design |
QT |
|
Programming |
GNU/C, GNU/C++, STL, Java |
|
Scripting |
Python, Perl, Sed, Awk, Shell, XML/XSL |
|
Scientific |
Matlab |
|
Web Design |
HTML, CSS, PHP, CGI, Java Script, Ajax |
|
Database |
MySQL, MS-SQL, Postgres SQL |
| Internship |
Oracle Inc, Redwood Shores HQ, USA Role : Software Development Engineer Intern Group : Clusterware/RAC (Real Application Clusters) Description : As part of the internship, I worked with GIPC (Grid Inter Process Communication) layer which provided IP address abstraction to the communication layer in use. This provided high availability of the nodes in a cluster without limiting to the private network inter connects(NIC) which was in use before. The code has been included in the Oracle database 11.2g release. |
| Internship |
Amazon IDC, Hyderabad, INDIA Role : Software Development Engineer Intern Group : Transportation Core Description : As part of the internship, I developed a web GUI which involved dynamic editing based on the user given parameters. This tool helped many software engineers to make extensive use of the cgi and helped many newly joined engineers to get quickly adapted with the system. |
| R.A Ship |
Research Assistant for NMBL (Neuromuscular Bio Mechanics Lab) under Prof Scott Delp Description: Development and Investigation of algorithms for Knee Image Segmentation of MRI Images. Presented my research work at NAMIC-AHM 2009 at MIT ( NAMIC Presentation ) |
| R.A Ship |
Research Assistant for CEES (Stanford Center for Computational Earth and Environmental Science) under Prof Jerry M. Harris Description: Developed an application for enhancing Geological Data Manipulation. |
| R.A Ship |
Research Assistant for Esagu (a web-based personalized agro-advisory system which uses Information Technology to solve the unscientific agricultural practices.) under Prof Dr. P.K. Reddy Description: Developed Recommendation system for agricultural scientists in giving crop advices. |
|
December 2007 - March 2008 |
Development of Cache for Distributed Database Management System Dr. Kamal Karlapalem, CDE A Semantic Cache has been built for a distributed database management system over several machines connected via LAN. The cache increased the performance of the system several times on an average and made the query execution efficient. |
|
December 2007 - March 2008 |
Development of Distributed Database Management System Dr. Kamal Karlapalem, CDE This project aims at implementation of a distributed relational database management system over several machines connected via LAN. The DDBMS layer should support both read-only and update queries. The project proved that Distributed Database Query Processing time is efficient and reduced lot of network costs in transferring data. |
|
Summer 2007 |
|
|
December 2006 - December 2007 |
Development of a multi-class text classifier on NASA dataset B-tech Project A multi-class text classifier to categorize flight accidents data provided by NASA to help automate categorization of future flight accidents without any manual work. The major challenge in this project is that the dataset given to us is not complete. Certain data omissions have taken place in the middle for security purposes. So a normal person would not be able to understand the data. This will be further extended to work on other datasets like UIC dataset Poster |
|
November 2006 - December 2006 |
Implementation of "pText" datatype, an advanced version of Text Datatype CDE "Unstructured" text always has the limitation of its inapplicability of data mining techniques on it. Though normal databases like Postgres provides "text" datatype, it doesnt satisfy the requirements of information retrieval today. In order to overcome this limitation, I designed and implemented an advanced version of text datatype in Postgres database |
|
July 2006 - November 2006 |
A tool to summarize data distributed across machines Honors Project A tool to summarize data distribution across different machines with different parameters like no of machines, size of dataset, load per machine, etc. It gives the user the most optimal distribution and demonstrates the idea by a graph. This project is funded by Google IDC, Hyderabad. Poster |
|
Summer 2006 |
An implementation of ACME on Language Data Most of the classifier proposed performed extensive experiments to show their techniques to be both efficient and accurate. ACME ( An Associative Classifier based on Maximum Entropy) is a new classifier which is based on association rule mining. Our classifier rests on the maximum entropy principle for its statistical basis and does not assume any independence not inferred from the given dataset. An implementation of this has been tested on language data provided by Language Techonologies Research Center( LTRC ),IIIT Hyderabad. |
|
Summer 2005 |
Post-processor for Indian Language OCRs, Dr. C.V.Jawahar, CVIT A language independent post-processor for an OCR was developed and integrated with Indian language OCR system and tested in a team of three. The post-processor recieves as input the words as identified by the OCR and returns a ranked set of words that are most suitable to replace the given input if it is found to be erronous by using a dictionary. The post-processor has been tested with an Itrans plugin on two Indian languages(Malayalam and Telugu). Project Page |
|
Information Extraction and Retrieval |
This project involved the development of Named Entity Recogniser for Telugu,an Indian Language. |
|
Software Engineering |
A case study of a software development project for development of a tourist portal for France. |
|
Web Data Knowledge Management |
Implementation of paper "Combining link and Content Information in Web Search" by Matthew Richardson and Pedro Domingos. |
|
DataWarehousing and Data Mining |
Implemented the algorithm to find most frequent sequence followed by winners in a Chess Dataset. |
|
Computer Graphics |
Developed the game of tetris and implemented a graphics pipeline in OpenGl using only the in-built vertex(glVertex) primitive. |
|
Theory of Computation |
Implemented a convertor of Non-deterministic Finite State Automata to optimized Deterministic Finite state Automata. |
|
Database Management Systems |
Made a requirements analysis for a newspaper company to handle data of its classified advertisements department and designed an ER-model for Election Comission to handle database of various political parties in a decentralised system of governance and implemented a database for travel agency in MySQL to handle its resources and customers. |
|
Operating Systems |
Developed a shell with minimal functionalities and implemented a chat program using pipes. |
|
IT-Workshop2 |
Developed Intranet Quotations website which has quotations classified based on various topics for browsing, a search engine and a shopping cart developed to grab any interesting quotes and a personlised recommendation system. |
|
Basic |
Engineering Mathematics I and II, Programming, Physics I and II, Digital Logic and Design, Electronic Circuits, IT Workshop I,II and III and Electrical Science. |
|
Core |
Data Structures, Computer Organization, Algorithms, Database Management Systems, Operating Systems, Computer Networks, Theory of Computation, Principles of Programming Languages, Compilers and Software Engineering. |
|
Advanced |
Artificial Intelligence, Linear Programming, DataWarehousing and Data Mining, Web Data Knowledge Management, Information Extraction and Retrieval |
| Operating Systems |
Teaching Assistant in the monsoon semester of 2006 |
| Artificial Intelligence |
Teaching Assistant in the spring semester of 2006 |