The following is a bibliography from an ACM SIGIR 2002 tutorial given by Bharat, Broder, Hawking, and Raghavan.
- Abit97
-
S. Abiteboul, D. Quass, J. McHugh, and J. Wiener.
The lorel query language for semistructured data.
International Journal on Digital Libraries, 1(1):68-88, 1997.
http://www-db.stanford.edu/pub/papers.
- Albe99
-
R. Albert, H. Jeong, and A.-L. Barabasi.
Diameter of the world wide web.
Nature, 401:130-131, 1999.
- Amen00
-
B. Amento, L. Terveen, and W. Hill.
Does "authority" mean quality? predicting expert quality ratings of
web documents.
In Proceedings of ACM SIGIR'00, pages 296-303, Athens, Greece,
2000.
- Amit98
-
E. Amitay.
Using common hypertext links to identify the best phrasal description
of target web documents.
In ACM SIGIR 98 Workshop on Hypertext IR for the Web,
Melbourne, 1998.
- Amit00
-
E. Amitay and C. Paris.
Automatically summarizing web sites - is there a way around it?
In ACM 9th International Conference on Information and Knowledge
Management (CIKM 2000), Washington, DC, 2000.
- Aroc97
-
G. O. Arocena, A. O. Mendelzon, and G. A. Mihaila.
Applications of a Web query language.
Computer Networks and ISDN Systems, 29(8-13):1305-1315, 1997.
http://www.cs.toronto.edu/~websql/www-conf/wsql/PAPER267.html.
- Babe97
-
Babel Team.
Web languages hit parade, June 1997.
http://babel.alis.com/palmares.html.
- Baez99
-
R. Baeza-Yates and B. Ribeiro-Neto.
Modern Information Retrieval.
Addison-Wesley, 1999.
- Bail01
-
P. Bailey, N. Craswell, and D. Hawking.
Engineering a multi-purpose test collection for web retrieval
experiments.
In press. http://www.ted.cmis.csiro.au/~dave/cwc.ps.gz, 2001.
- BarI02
-
J. Bar-Ilan.
Methods for measuring search engine performance over time.
JASIST, 53(4):308-319, 2002.
http://www.asis.org/Publications/JASIS/vol53n04.html.
- BarY00
-
Z. Bar-Yossef, A. Berg, S. Chien, J. Fakcharoenphol, and D. Weitz.
Approximating aggregate queries about Web pages via random walks.
In VLDB 2000, Proceedings of 26th International Conference on
Very Large Data Bases, September 10-14, 2000, Cairo, Egypt, pages 535-544.
Morgan Kaufmann Publishers, 2000.
http://www.vldb.org/dblp/db/conf/vldb/Bar-YossefBCFW00.html.
- Bara99
-
A. Barabasi and R. Albert.
Emergence of scaling in random networks.
Science, 286:509, 1999.
- Bhar00b
-
K. Bharat.
Searchpad: explicit capture of search context to support web search.
In Proceedings of WWW9, pages 493-501, 2000.
http://www9.org/w9cdrom/173/173.html.
- Bhar98a
-
K. Bharat and A. Broder.
A technique for measuring the relative size and overlap of public web
search engines.
In Proceedings of WWW7, pages 379-388, 1998.
http://www-sor.inria.fr/mirrors/www7/programme/fullpapers/1937/com1937.htm.
- Bhar98b
-
K. Bharat and A. Broder.
A technique for measuring the relative size and overlap of public web
search engines.
In Proceedings of WWW7, pages 369-477, 1998.
- Bhar99
-
K. Bharat and A. Broder.
Mirror, mirror on the web: A study of host pairs with replicated
content.
In Proceedings of WWW8, pages 501-512, Toronto, 1999.
http://www8.org/w8-papers/4c-server/mirror/mirror.html.
- Bhar00c
-
K. Bharat, A. Z. Broder, J. Dean, and M. R. Henzinger.
A comparison of techniques to find mirrored hosts on the WWW.
Journal of the American Society of Information Science,
51(12):1114-1122, 2000.
- Bhar01b
-
K. Bharat, B. Chang, M. Henzinger, and M. Ruhl.
Who links to whom: Mining linkage between web sites.
In Proceedings of IEEE ICDM-01, pages 51-58, 2001.
http://theory.lcs.mit.edu/~ruhl/papers/2001-icdm.html.
- Bhar98
-
K. Bharat and M. Henzinger.
Improved algorithms for topic distillation in a hyperlinked
environment.
In Proceedings of ACM SIGIR'98, pages 104-111, 1998.
- Bhar00
-
K. Bharat and G. Mihaila.
Hilltop: A search engine based on expert documents.
In Poster proceedings of WWW9, pages 72-73, 2000.
- Boro01
-
A. Borodin, G. Roberts, J. Rosenthal, and P. Tsaparas.
Finding authorities and hubs from link structures on the www.
In Proceedings of WWW10, May 2001.
http://www10.org/cdrom/papers/pdf/p314.pdf.
- Brin98
-
S. Brin and L. Page.
The anatomy of a large-scale hypertextual Web search engine.
In Proceedings of WWW7, pages 107-117, 1998.
http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm.
- Brod98
-
A. Broder.
On the resemblance and containment of documents.
In Compression and Complexity of Sequences (SEQUENCES'97),
pages 21-29. IEEE Computer Society, 1998.
ftp://ftp.digital.com/pub/DEC/SRC/publications/broder/positano-final-wpnums.pdf.
- Brod97
-
A. Broder, S. Glassman, M. Manasse, and G. Zweig.
Syntactic clustering of the web.
In Proceedings of WWW6, pages 391-404, 1997.
- Brod00
-
A. Broder, S. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata,
A. Tomkins, and J. Wiener.
Graph structure in the web: experiments and models.
In Proceedings of WWW9, Amsterdam, 2000.
http://www.www9.org/w9cdrom/160/160.html.
- Bruz00
-
P. Bruza, R. McArthur, and S. Dennis.
Interactive internet search: keyword, directory and query
reformulation mechanisms compared.
In Proceedings of ACM SIGIR'2000, Athens, Greece, 2000.
- Buck00
-
C. Buckley and E. Voorhees.
Evaluating evaluation measure stability.
In Proceedings of ACM SIGIR'00, pages 33-40, Athens, Greece,
2000.
- Call99
-
J. Callan, M. Connell, and A. Du.
Automatic discovery of language models for text databases.
In Proceedings of ACM SIGMOD'99, pages 479-490, New York,
1999.
- Call95
-
J. P. Callan, Z. Lu, and W. B. Croft.
Searching distributed collections with inference networks.
In Proceedings of ACM SIGIR'95, pages 12-20, Seattle, WA,
1995.
- Carr97
-
J. Carriere and R. Kazman.
Webquery: Searching and visualizing the web through connectivity.
In Proceedings of WWW6, 1997.
http://www.cgl.uwaterloo.ca/Projects/Vanish/webquery-1.html.
- Chak98b
-
S. Chakrabarti, B. Dom, D. Gibson, R. Kumar, P. Raghavan, A. Tomkins, and
S. Rajagopalan.
Spectral filtering for resource discovery.
In ACM SIGIR Workshop on Hypertext IR for the Web, Melbourne,
1998.
- Chak99c
-
S. Chakrabarti, B. Dom, and P. Indyk.
Enhanced hypertext classification using hyperlinks.
In Proceedings of ACM SIGMOD'98, 1998.
- Chak98
-
S. Chakrabarti, B. Dom, P. Raghavan, S. Rajagopalan, D. Gibson, and
J. Kleinberg.
Automatic resource compilation by analyzing hyperlink structure and
associated text.
In Proceedings of WWW7, pages 65-74, Brisbane, 1998.
http://www7.scu.edu.au/programme/fullpapers/1898/com1898.html.
- Chak01
-
S. Chakrabarti, M. Joshi, and V. Tawde.
Enhanced topic distillation using text, markup tags, and hyperlinks.
In Proceedings of ACM SIGIR'2001, pages 208-216, New Orleans,
USA, 2001.
- Chak99
-
S. Chakrabarti, M. van den Berg, and B. Dom.
Focused crawling: A new approach to topic-specific web resource
discovery.
In Proceedings of WWW8, Toronto, 1999.
- Chi01
-
E. H. Chi, P. Pirolli, K. Chen, and J. Pitkow.
Using information scent to model user information needs and actions
on the web.
In Proceedings of ACM CHI 2001, pages 490-497, Seattle, 2001.
- Cho00
-
J. Cho and H. Garcia-Molina.
The evolution of the web and implications for an incremental crawler.
In Proceedings of the Twenty-sixth International Conference on
Very Large Databases, 2000.
http://rose.cs.ucla.edu/~cho/papers/cho-evol.pdf.
- Cho98
-
J. Cho, H. Garcia-Molina, and L. Page.
Efficient crawling through url ordering.
In Proceedings of WWW7, pages 161-172, Brisbane, 1998.
http://www7.scu.edu.au/programme/fullpapers/1919/com1919.htm.
- Cras99
-
N. Craswell, P. Bailey, and D. Hawking.
Is it fair to evaluate web systems using trec ad hoc methods?
ACM SIGIR '99 Workshop on Web Retrieval, 1999.
http://pastime.anu.edu.au/nick/pubs/sigir99ws.ps.gz.
- Cras00
-
N. Craswell, P. Bailey, and D. Hawking.
Server selection on the world wide web.
In Proceedings of the ACM Digital Libraries Conference, San
Antonio, Texas, pages 37-46, June 2000.
- Cras01
-
N. Craswell, D. Hawking, and K. Griffiths.
Which search engine is best at finding airline site home pages?
Technical Report 01/45, CSIRO Mathematical and Information Sciences,
2001.
http://www.ted.cmis.csiro.au/~nickc/pubs/airlines.pdf.
- Cras01a
-
N. Craswell, D. Hawking, and S. Robertson.
Effective site finding using link anchor information.
In Proceedings of ACM SIGIR 2001, pages 250-257, New Orleans,
2001.
http://www.ted.cmis.csiro.au/nickc/pubs/sigir01.pdf.
- Cras99b
-
N. Craswell, D. Hawking, and P. Thistlewaite.
Merging results from isolated search engines.
In Proceedings of the 10th Australasian Database Conference,
Auckland, NZ, pages 189-200. Springer-Verlag, 1999.
http://www.ted.vic.cmis.csiro.au/~nickc/pubs/adc99.ps.gz.
- WEBT01
-
CSIRO.
Trec web tracks home page.
http://www.ted.cmis.csiro.au/TRECWeb/, 2001.
- Davi00
-
B. Davison.
Recognizing nepotistic links on the web.
In AAAI workshop on AI in web search, 2000.
http://archive.org/pub/aaai2000/BDavison2000.ps.
- Davi00b
-
B. Davison.
Topical locality in the web.
In Proceedings of ACM SIGIR'2000, pages 272-279, Athens,
Greece, 2000.
http://www.cs.rutgers.edu/~davison/pubs/2000/sigir/.
- Davi00a
-
B. Davison.
Topical locality in the web: Experiments and observations.
Technical report, Department of Computer Science, Rutgers University,
2000.
http://www.cs.rutgers.edu/pub/technical-reports/dcs-tr-414.ps.Z.
- Dean99
-
J. Dean and M. R. Henzinger.
Finding related pages in the World Wide Web.
Computer Networks (Amsterdam, Netherlands: 1999),
31(11-16):1467-1479, 1999.
http://research.compaq.com/SRC/WebArcheology/papers/companion.ps.
- Deer90
-
S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman.
Indexing by latent semantic analysis.
Journal of the American Society for Information Science,
41(6):391-407, 1990.
http://www.si.umich.edu/%7Efurnas/POSTSCRIPTS/LSI.JASIS.paper.ps.
- Drei97
-
D. Dreilinger and A. E. Howe.
Experiences wtih selecting search engines using metasearch.
ACM Transactions on Information Systems, 15(3):195-222, 1997.
- Eggh90
-
L. Egghe and R. Rousseau.
Introduction to Informetrics.
Elsevier, 1990.
- Ethn01
-
Ethnologue.
Ethnologue language name index.
http://www.ethnologue.com/language_index.asp.
- Fagi00
-
R. Fagin, A. Karlin, J. Kleinberg, P. Raghavan, S. Rajagopalan, R. Rubinfeld,
M. Sudan, and A. Tomkins.
Random walks with ``back buttons'' (extended abstract).
In Proceedings of STOC 2000, pages 484-493, 2000.
- Frak92
-
W. Frakes.
Stemming algorithms.
In W. B. Frakes and R. Baeza-Yates, editors, Information
Retrieval. Data Structures and Algorithms, pages 131-160. Prentice Hall,
Upper Saddle River NJ, 1992.
- Garf72
-
E. Garfield.
Citation analysis as a tool in journal evaluation.
Science, 178:471-479, 1972.
- Gauc96
-
S. Gauch and G. Wang.
Information fusion with ProFusion.
In Proceedings of WebNet '96: The First World Conference of
the Web Society, pages 174-179, October 1996.
Also at http:// www.designlab.ukans.edu/ProFusion.html.
- Gilb97
-
N. Gilbert.
A simulation of the structure of academic science.
Sociological Research Online, 2(2), 1997.
- Gord99
-
M. Gordon and P. Pathak.
Finding information on the world wide web: The retrieval
effectiveness of search engines.
Information Processing and Management, 35(2):141-180, March
1999.
- Grav97
-
L. Gravano, K. Chang, H. Garcia-Molina, C. Lagoze, and A. Paepcke.
STARTS - Stanford protocol proposal for internet retrieval and
search.
http:// www-db.stanford.edu/$_~$gravano/starts.html,
January 1997.
- Harm92
-
D. Harman.
Evaluation issues in information retrieval.
Information Processing and Management, 28(4):439-440, 1992.
- Harm95
-
D. Harman.
The trec conferences.
In R. Kuhlen and M. Rittberger, editors, Proceedings of HIM 95,
1995.
- Have02
-
T. H. Haveliwala.
Topic-sensitive pagerank.
In Proceedings of WWW2002, Honolulu, May 2002.
http://www2002.org/CDROM/refereed/127/.
- Hawk01
-
D. Hawking, N. Craswell, P. Bailey, and K. Griffiths.
Measuring search engine quality.
Information Retrieval, 4(1):33-59, 2001.
pre-press version at
http://www.ted.cmis.csiro.au/~dave/INRT83-00.ps.gz.
- Hawk01a
-
D. Hawking, N. Craswell, and K. Griffiths.
Which search engine is best at finding online services?
In Poster Proceedings of WWW10, May 2001.
www.ted.cmis.csiro.au/~dave/www10poster.pdf.
- Hawk99a
-
D. Hawking, N. Craswell, P. Thistlewaite, and D. Harman.
Results and challenges in web search evaluation.
Proceedings of WWW8, 31:1321-1330, 1999.
http://www8.org/w8-papers/2c-search-discover/results/results.html.
- Hawk02
-
D. Hawking and S. Robertson.
On collection size and retrieval effectiveness.
Information Retrieval, To appear.
- Hawk99b
-
D. Hawking and P. Thistlewaite.
Methods for information server selection.
ACM Transactions on Information Systems., 17(1):40-76, 1999.
- Hawk99
-
D. Hawking, E. Voorhees, N. Craswell, and P. Bailey.
Overview of trec-8 web track.
In Proceedings of TREC-8, pages 131-150, Gaithersburg MD,
November 1999.
http://trec.nist.gov/pubs/trec8/t8_proceedings.html.
- Hear95
-
M. Hearst.
Tilebars: Visualization of term distribution in full text information
access.
In Proceedings of ACM SIGCHI'95, pages 59-66, Denver, 1995.
- Henz99
-
M. R. Henzinger, A. Heydon, M. Mitzenmacher, and M. Najork.
Measuring index quality using random wals on the web.
In Proceedings of the Eighth International World-Wide Web
Conference, 1999.
http://www9.org/w9cdrom/88/88.html.
- Hube98
-
B. Huberman, P. Pirolli, J. Pitkow, and R. Lukose.
Strong regularities in world wide web surfing.
Science, 280:95-97, 1998.
- Hull96
-
D. Hull.
Stemming algorithms: A case study for detailed evaluation.
Journal of the American Society for Information Science,
47:70-84, 1996.
- Jans98
-
B. J. Jansen, A. Spink, J. Bateman, and T. Saracevic.
Real life information retrieval: A study of user queries on the
Web.
ACM SIGIR Forum, 32(1):5-17, 1998.
- Jarv00
-
K. Järvelin and J. Kekäläinen.
IR methods for retrieving highly relevant documents.
In Proceedings of ACM SIGIR'00, pages 41-48, Athens, Greece,
2000.
- Bhar01
-
K.Bharat and G. Mihaila.
When experts agree: Using non-affiliated experts to rank popular
topics.
In Proceedings of WWW10, Hong Kong, 2001.
(To appear in ACM TOIS.) http://www10.org/cdrom/papers/474/.
- Kess63
-
M. M. Kessler.
Bibliographic coupling between scientific papers.
American Documentation, 14, 1963.
- Kirs97
-
S. T. Kirsch.
Distributed search patent.
U.S. Patent 5,659,732, August 1997.
Infoseek Corporation.
http://software.infoseek.com/patents/dist_search/patents.htm.
- Kist98
-
T. Kistler and H. Marais.
Webl - a programming language for the web.
In Proceedings of WWW7, pages 259-270, Brisbane, 1998.
http:///research.compaq.com/SRC/WebL/related/papers/www7/paper.html.
- Klei98
-
J. Kleinberg.
Authoritative sources in a hyperlinked environment.
In Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete
Algorithms, pages 668-677, 1998.
http://www.cs.cornell.edu/home/kleinber/auth.ps Also J.ACM?
- Kuma00
-
S. Kumar, P. Raghavan, S. Rajagopalan, D. Sivakumar, A. Tomkins, and E. Upfal.
The web as a graph: Measurements, models and methods.
In Proceedings of ACM Symposium on Principles of Database
Systems, pages 1-10, 2000.
- Kuma99b
-
S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins.
Extracting large-scale knowledge bases from the web.
In Proceedings of VLDB, pages 639-650, 1999.
- Kuma99
-
S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins.
Trawling the web for emerging cyber communities.
In Proceedings of WWW8, Toronto, 1999.
- Lars96
-
R. Larson.
Bibliometrics of the world wide web: An exploratory analysis of the
intellectual structure of cyberspace.
In Annual Meeting of the American Society Information Science,
1996.
- Lawr98b
-
S. Lawrence and C. Giles.
Inquirus, the NECI meta search engine.
In Proceedings of WWW7, pages 95-105, 1998.
- Lawr98
-
S. Lawrence and C. Giles.
Searching the world wide web.
Science, 280:98-100, Apr. 1998.
- Lawr99b
-
S. Lawrence and C. Giles.
Accessibility of information on the web.
Nature, 400:107-109, 8 July 1999.
- Lawr99
-
S. Lawrence, C. Giles, and K. Bollacker.
Digital libraries and autonomous citation indexing.
IEEE Computer, 32(6):67-71, 1999.
http://www.neci.nj.nec.com/homepages/lawrence/citeseer.html.
- LeCa2000
-
A. LeCalve and J. Savoy.
Database merging strategy based on logistic regression.
Information Processing and Management, 36(3):341-359, 2000.
- Lotk26
-
A. Lotka.
The frequency distribution of scientific productivity.
Journal of the Washington Academy of Science, 16(317), 1926.
- Marc97
-
M. Marchiori.
The quest for correct information on the web: Hyper search engines.
In Proceedings of WWW6, Santa Clara, 1997.
- Mcbr94
-
O. McBryan.
GENVL and WWWW: Tools for taming the web.
In Proceedings of WWW1, 1994.
- Mend97
-
A. Mendelzon, G. Mihaila, and T. Milo.
Querying the world wide web.
Journal of Digital Libraries, 1(1):68-88, 1997.
- Mill56
-
G. Miller.
The magical number seven, plus or minus two: Some limits on our
capacity for processing information.
The Psychological Review, 63:81-97, 1956.
Reproduced at: http://www.well.com/user/smalin/miller.html.
- Netc02
-
Netcraft.
Netcraft web server survey.
http://www.netcraft.com/survey.
- TREC01
-
NIST.
Trec home page.
http://trec.nist.gov/, 2001.
- Note02
-
G. Notess.
Search engine showdown.
http://notess.com/.
- ONei97
-
E. T. O'Neill, P. D. McClain, and B. F. Lavoie.
A methodology for sampling the world wide web.
Technical report, OCLC Annual Review of Research, 1997.
http://www.oclc.org/research/publications/arr/1997/oneill/o0213.htm.
- Page98
-
L. Page, S. Brin, R. Motwani, and T. Winograd.
The PageRank citation ranking: Bringing order to the Web.
Technical report, Stanford Digital Library Technologies Project,
1998.
http://www-db.stanford.edu/~backrub/pageranksub.ps.
- Pa1897
-
V. Pareto.
Cours d'économie politique.
Rouge, Lausanne and Paris, 1897.
- Piro96
-
P. Pirolli, J. Pitkow, and R. Rao.
Silk from a sow's ear: Extracting usable structures from the web.
In Proceedings of CHI 96, pages 118-125, 1996.
- Pitk97
-
J. Pitkow.
Characterizing World Wide Web ecologies.
PhD thesis, Georgia Institue of Technology, June 1997.
- Port80
-
M. Porter.
An algorithm for suffix stripping.
Program, 14:130-137, 1980.
- Rade02
-
D. R. Radev, K. Libner, and W. Fan.
Getting answers to natural language questions on the web.
JASIST, 53(5), 2002.
http://www.asis.org/Publications/JASIS/vol53n05.html.
- Rand01
-
K. Randall, R. Stata, R. Wickremesinghe, and J. Wiener.
The link database: Fast access to graphs of the web.
Technical Report Research Report 175, Compaq, Systems Research
Center, Palo Alto, CA, 2001.
http://gatekeeper.research.compaq.com/pub/DEC/SRC/research-reports/abstracts/src-rr-175.html.
- Raso02
-
Y. Rasolofo, D. Hawking, and J. Savoy.
Result merging strategies for a current news metasearcher.
in submission, 2002.
- Rocc71
-
J. Rocchio.
Relevance feedback in information retrieval.
In G. Salton, editor, The SMART System: Experiments in Automatic
Document Processing, pages 313-323. Prentice Hall, Englewood Cliffs, NJ,
1971.
- Rusm01
-
P. Rusmevichientong, D. M. Pennock, S. Lawrence, and C. L. Giles.
Methods for sampling pages uniformly from the world wide web.
In AAAI Fall Symposium on Using Uncertainty Within
Computation, pages 121-128, 2001.
- Sala99
-
M. Salampasis and J. Tait.
A link-based collection fusion strategy.
Information Processing and Management, 35(5):691-711, 1999.
- Salt88
-
G. Salton and C. Buckley.
Term-weighting approaches in automatic text retrieval.
Information Processing and Management, 24:513-523, 1988.
- Salt90
-
G. Salton and C. Buckley.
Improving retrieval performance by relevance feedback.
Information Processing and Management, 26:73-92, 1990.
- Savo97
-
J. Savoy.
Statistical inference in retrieval effectiveness evaluation.
Information Processing and Management, 33(4):495-512, 1997.
- Schu97
-
H. Schütze and C. Silverstein.
Projections for efficient document clustering.
In Proceedings of ACM SIGIR'97, pages 74-81, Philadelphia,
1997.
- Selb95
-
E. Selberg and O. Etzioni.
Multi-service search and comparison using the meta-crawler.
In Proceedings of WWW4, Boston MA, 1995.
- Shak97
-
J. Shakes, M. Langheinrich, and O. Etzioni.
Dynamic reference sifting: A case study in the homepage domain.
In Proceedings of WWW6, pages 1193-1204, Santa Clara, 1997.
http://huskysearch.cs.washington.edu:6060/doc/presentation/.
- Shiv99b
-
Shivakumar and Garcia-Molina.
Finding near-replicas of documents on the web.
In WEBDB: International Workshop on the World Wide Web and
Databases, WebDB. LNCS, 1999.
http://www-db.stanford.edu/~shiva/Pubs/web.ps.
- Shiv99
-
N. Shivakumar, J. Cho, and H. Garcia-Molin.
Finding replicated web collection.
Technical report, Department of Computer Science, Stanford
University, 1999.
http://www-db.stanford.edu/pub/papers/cho_mirror.ps.
- Shiv95
-
N. Shivakumar and H. Garcia-Molina.
SCAM: A copy detection mechanism for digital documents.
In Proceedings of ACM DL'95, Austin TX, 1995.
- Silv98
-
C. Silverstein, M. Henzinger, H. Marais, and M. Moricz.
Analysis of a very large web search engine query log.
SIGIR Forum, 33(1):6-12, 1999.
Previously available as Digital Systems Research Center TR 1998-014
at http://www.research.digital.com/SRC.
- Sing01
-
A. Singhal and M. Kaszkiel.
A case study in web search using trec algorithms.
In Proceedings of WWW10, pages 708-716, Hong Kong, 2001.
http://www.www10.org/cdrom/papers/pdf/p317.pdf.
- Smal73
-
H. Small.
Co-citation in the scientific literature: A new measure of the
relationship between two documents.
Journal of the American Society for Information Science, 24,
1973.
- Spar98
-
K. Sparck Jones, S. Walker, and S. Robertson.
A probabilistic model of information retrieval : Development and
status.
Technical Report TR 446, Cambridge University Computer Laboratory,
September 1998.
- Sper97
-
E. Spertus.
Parasite: Mining structural information on the web.
In Proceedings of WWW6, Santa Clara, 1997.
http://www6.nttlabs.com/HyperNews/get/PAPER206.html.
- Spin02
-
A. Spink, editor.
JASIST, volume 53, chapter Special Issue on Web Research.
Pergamon Press, 2002.
http://www.asis.org/Publications/JASIS/vol53n02.html.
- Spin02a
-
A. Spink.
A user-centered approach to evaluating human interaction with web
search engines: an exploratory study.
Information Processing and Management, 38(3):401-426, 2002.
- Spin00
-
A. Spink and J. Qin, editors.
Information Processing and Management, volume 36, chapter
Special Issue on Web-based Information Retrieval Research.
Pergamon Press, 2000.
- Stat00
-
R. Stata, K. Bharat, and F. Maghoul.
The term vector database: fast access to indexing terms for web
pages.
In Proceedings of WWW9, pages 247-255, Amsterdam, 2000.
http://www.www9.org/w9cdrom/159/159.html.
- Sten99
-
D. Stenmark.
Method for intranet search engine evaluations.
In Proceedings of IRIS22, Department of CS/IS, University of
Jyväskylä, Finland, August 1999.
http://w3.informatik.gu.se/~dixi/publ/method.pdf.
- Stev46
-
S. Stevens.
On the theory of scales of measurement.
Science, 103(2684):677-680, 1946.
- Trav01
-
R. Travis and A. Broder.
Web search quality vs. informational relevance.
In Proceedings of the 2001 Infonortics Search Engines Meeting,
Boston, 2001.
http://www.infonortics.com/searchengines/sh01/slides-01/travis.html.
- Turp01
-
A. Turpin and W. Hersh.
Why batch and user evaluations do not give the same results.
In Proceedings of ACM SIGIR'01, page ?, New Orleans, LA, 2001.
- Voor98b
-
E. Voorhees.
Using wordnet for text retrieval.
In C. Fellbaum, editor, WordNet: An Electronic Lexical
Database, pages 285-303. The MIT Press, Cambridge MA, 1998.
- Voor98
-
E. Voorhees.
Variations in relevance judgments and the measurement of retrieval
effectiveness.
In Proceedings of ACM SIGIR'98, pages 315-323, 1998.
http://www.itl.nist.gov/iaui/894.02/works/papers/sigir98.dvi.ps.
- Voor01
-
E. Voorhees.
Evaluation by highly relevant documents.
In Proceedings of ACM SIGIR'01, New Orleans, LA, 2001.
- Voor95
-
E. M. Voorhees, N. K. Gupta, and B. Johnson-Laird.
Learning collection fusion strategies.
In Proceedings of ACM SIGIR'95, pages 172-179, Seattle, WA,
1995.
- Whit89
-
H. White and K. McCain.
Ann. Rev. Info. Sci. and Technology, chapter Bibliometrics,
pages 119-186.
Elsevier, 1989.
- W3C01
-
World Wide Web Consortium.
W3c internationalization/localization: Character sets supported by
popular web applications.
http://www.w3.org/International/O-charset-list.html.
- Yule44
-
G. Yule.
Statistical Study of Literary Vocabulary.
Cambridge University Press, 1944.
- Zami98
-
O. Zamir and O. Etzioni.
Web document clustering: A feasibility demonstration.
In Proceedings of ACM SIGIR'98, pages 46-54, Melbourne, 1998.
- Zami99
-
O. Zamir and O. Etzioni.
Grouper: A dynamic clustering interface to web search results.
In Proceedings of WWW8, pages 1361-1374, Toronto, 1999.
- Zipf49
-
G. Zipf.
Human Behavior and the Principle of Least Effort.
Addison-Wesley, 1949.