|
Research paper on search technology:
Iterative
Filtering of Retrieved Information to Increase Relevance
|
Efforts have been underway for years to find more effective ways
to retrieve information from large knowledge domains. This effort
is now being driven particularly by the Internet and the vast amount
of information that is available to unsophisticated users. In the
early days of the Internet, some effort involved allowing users
to enter Boolean equations of search terms into search engines,
for example, rather than just a list of keywords. More recently,
effort has focused on understanding a users desires from past
search histories in order to narrow searches. Also there has been
much effort to improve the ranking of results based on some measure
of relevancy. This paper discusses using iterative filtering of
retrieved information to focus in on useful information. This work
was done for finding source code correlation and the author extends
his findings to Internet searching and e-commerce. The paper presents
specific information about a particular filtering application and
then generalizes it to other forms of information retrieval.
Paper given at the 11th World Multi-Conference on Systemics, Cybernetics
and Informatics, July 11, 2007.
|
|
Research paper on plagiarism detection:
Measuring Whitespace Patterns As An Indication Of Plagiarism
|
There are several different methods of comparing source code from
different programs to find copying. Perhaps the most common method
is comparing source code statements, comments, strings, identifiers,
and instruction sequences. However, there are anecdotes about the
use of whitespace patterns in code. These virtually invisible patterns
of spaces and tabs have been used in litigation to imply copying,
but no formal study has been performed that shows that these patterns
can actually identify copied code. This paper presents a detailed
study of whitespace patterns and the uniqueness of these patterns
in different programs.
Paper given at the ADFSL Conference on Digital Forensics, Security
and Law, May 20, 2010.
|