Searching for domain-specific information on the web is,
We have different search methods over community documents but they are,
The research is in context of the WikiDisability Project to make disability specific documents more accessible to NGOs, stakeholders and people.
The documents involved are either web based blogs or electronic documents, represented in free text (as PDF) and in structured data (in RDF, on a wikibase instance)
We wished to compare different search methods for the best user-experience of the stakeholders involved.
We chose the following search methods to compare,
The wikibase dataset and the documents were uploaded for every user.
There were 24 documents and 17 candidates for the experiment.
Two different questionnaires were provided to the user,
ESDoc provided the most and relevant answers followed by QAnswer KG and QADoc
ESDoc also provided a false sense of information for instruction with no answers
The scores obtained from UEQ on different scales
Scales | QAnswer KG | ESDoc | QADoc |
---|---|---|---|
Attractive | -0.272 | -0.114 | -0.433 |
Perspicuity | -0.014 | -1.205 | -0.05 |
Efficiency | -0.22 | 0.014 | -0.583 |
Dependability | -0.132 | -0.014 | -0.266 |
Stimulation | -0.161 | 0.0588 | -0.1 |
Novelty | -0.088 | -0.191 | 0.266 |
The scores obtained in the experiment belong to categry bad for all scales in the UEQ benchmark.
One way ANOVA test
Scale | F-Ratio | P-Value |
---|---|---|
Attractive | 1.269 | 0.29 |
Perspicuity | 36.20 | < 0.001 |
Efficiency | 5.284 | 0.008 |
Dependability | 0.861 | 0.429 |
Stimulation | 1.78 | 0.179 |
Novelty | 3.2 | 0.049 |
qtukey values from Tukey-Kramer Test
critical value is 3.425
Scale | Groups | qtukey |
---|---|---|
Perspicuity | QAnswerKG vs ESDoc | 10.6742 |
ESDoc vs QADoc | 10.029 | |
QAnswerKG vs QADoc | 0.306 | |
Efficiency | QAnswerKG vs ESDoc | 1.86 |
ESDoc vs QADoc | 4.579 | |
QAnswerKG vs QADoc | 2.777 | |
Novelty | QAnswerKG vs ESDoc | 0.798 |
ESDoc vs QADoc | 3.439 | |
QAnswerKG vs QADoc | 2.665 |
We see that there is a significant difference between,
We presented a user-experience focused evaluation of search methods on domain-specific documents.
Elastic Search over Documents provided relevant answers but also provided a false sense of relevancy
For non-exploratory question answering with an exact answer, we need more than ESDoc.
QADoc was perceived to be innovative, but didn't perform well for information retreival.
We believe that there is a need to combine various search methods for different types of questions.
We therefore developed a demo to combine the different search methods, and introduced fallback for one to the other.
We employed wikibase to store data around the document, QADoc for the data inside the document.
If there is no confident answer from both, we do an elastic search where the keywords are highlighted.
We further plan to introduce a new set of documents to repeat the experiment with the concluded combined search demo to evaluate the differences.
Thank you for your time, questions?