Mit dem DDB Backend 7.x wurden dedizierte Suchendpunkte in der DDB-API abgeschafft. Der neue Suchendpunkt erlaubt einen fast uneingeschränkten (lesenden) Zugriff auf den dahinterliegenden Solr Server. Dadurch wird eine flexiblere Nutzung des Solr Servers und eine Vereinfachung für den Nutzer erreicht, der sich nur noch in die Solr Syntax und das Solr Parameterset einarbeiten muss.
Auf den folgenden Seiten Im Folgenden sind die Solr Collections und deren Einsatzzweck dokumentiert. Es werden alle definierten Feld und die dynamischen Felder sowie das komplette Schema der jeweiligen Collection gelistet. Aus Sicherheitsgründen werden statt der kompletten Konfiguration einer Collection nur die für den User relevanten Teile (die Definitionen der zur Suche notwendigen RequestHandler) gezeigt.
Die Nutzung der Collections und Solr Features durch die im DDB-Kontext entwickelten Frontends wird ständig weiterentwickelt und ist nicht vollständig dokumentiert.
Alle Collections sind durch ein Schema definiert. Details über die zur Suche verwendete Syntax kann der offiziellen Solr Dokumentation entnommen werden
Collection: Search
Search ist der Hauptindex für die Suche nach Objekten in der DDB. Für jedes DDB-Objekt gibt es in dieser Collection genau ein Solr Dokument, identifiziert durch die DDB-ID.
...
Dieser Index wird ausschließlich im Zeitungsportal genutzt und enthält alle Daten aus dem zuletzt eingespielten ZDB Dump. Details zu den im Zeitungsportal genutzten Suchanfragen sind hier zu finden.
Code Block | ||||||
---|---|---|---|---|---|---|
| ||||||
{ "fields":[{ "name":"frequency", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"glossary.location", "type":"glossary", "multiValued":true, "indexed":true, "required":false, "stored":false}, { "name":"glossary.title", "type":"glossary", "multiValued":false, "indexed":true, "required":true, "stored":true}, { "name":"hasFulltext", "type":"boolean", "multiValued":false, "indexed":true, "required":true, "stored":true}, { "name":"hasLoadedIssues", "type":"boolean", "multiValued":false, "indexed":true, "required":true, "stored":true}, { "name":"id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":true}, { "name":"isSupplementOf", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"language", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"location", "type":"text", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"locationUri", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"location_fct", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":false}, { "name":"printId", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"progress", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"sort.location", "type":"sort", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"sort.title", "type":"sort", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"subseries", "type":"text", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"supplements", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"thumbnail", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"title", "type":"text", "multiValued":true, "indexed":true, "required":true, "stored":true}]} |
...
Dieser Index enthält alles für das Zeitungsportal geladenen Daten. Die Besonderheit bei diesem Index ist die Kombination aus Ausgaben und Seiten. Jedes Solr Dokument in diesem Index repräsentiert entweder eine Ausgabe einer Zeitung oder eine Seite einer Ausgabe. Die Seiten sind als Child-Dokumente der Ausgaben indexiert. Dadurch wird die Suche innerhalb einer Ausgabe mit einer Seite als Ergebnis als auch die Suche im kompletten Bestand mit einer Ausgabe als Ergebnis ermöglicht. Details zu den im Zeitungsportal genutzten Anfragen sind hier zu finden.
Code Block | ||||||
---|---|---|---|---|---|---|
| ||||||
{ "fields":[{ "name":"_nest_parent_", "type":"idType", "indexed":true, "stored":true}, { "name":"_nest_path_", "type":"nest_path", "stored":true}, { "name":"_root_", "type":"idType", "indexed":true, "stored":false}, { "name":"_version_", "type":"long", "docValues":true, "multiValued":false, "indexed":false, "stored":false}, { "name":"dataset_id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"id", "type":"idType", "multiValued":false, "indexed":true, "required":true, "stored":true}, { "name":"ingest_date", "type":"date", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"issue_id", "type":"idType", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"issuefulltext", "type":"text", "multiValued":false, "indexed":true, "stored":true}, { "name":"language", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"ns_disclaimer_required", "type":"boolean", "indexed":true, "required":false, "stored":true}, { "name":"pagefulltext", "type":"text_ocr", "multiValued":true, "indexed":true, "stored":true}, { "name":"pagename", "type":"string", "indexed":true, "stored":true}, { "name":"pagenumber", "type":"int", "multiValued":false, "indexed":false, "stored":true}, { "name":"paper_title", "type":"text", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"paper_title_fct", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"paper_title_fct_autocomplete", "type":"autocomplete", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"place_of_distribution", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"place_of_distribution_autocomplete", "type":"autocomplete", "multiValued":true, "indexed":true, "required":false, "stored":false}, { "name":"plainpagefulltext", "type":"text", "multiValued":false, "indexed":true, "stored":true}, { "name":"preview_reference", "type":"string", "multiValued":false, "indexed":false, "stored":true}, { "name":"provider", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"provider_autocomplete", "type":"autocomplete", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"provider_ddb_id", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"publication_date", "type":"date_range", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"publication_day_of_week", "type":"int", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"publication_year", "type":"int", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"revision_id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"sort.publication_date", "type":"date", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"supplier_id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"thumbnail", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"type", "type":"idType", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"zdb_id", "type":"idType", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"zdb_paper_title_fct", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":false}]} |
...
Code Block | ||||||
---|---|---|---|---|---|---|
| ||||||
{ "schema":{ "name":"example", "version":1.4, "uniqueKey":"id", "fieldTypes":[{ "name":"autocomplete", "class":"solr.TextField", "positionIncrementGap":"100", "indexAnalyzer":{ "charFilters":[{ "class":"solr.MappingCharFilterFactory", "mapping":"mapping_sortfield.txt"}], "tokenizer":{ "class":"solr.WhitespaceTokenizerFactory"}, "filters":[{ "class":"solr.LowerCaseFilterFactory"}, { "class":"solr.TrimFilterFactory"}, { "class":"solr.EdgeNGramFilterFactory", "maxGramSize":"15", "minGramSize":"2"}]}, "queryAnalyzer":{ "charFilters":[{ "class":"solr.MappingCharFilterFactory", "mapping":"mapping_sortfield.txt"}], "tokenizer":{ "class":"solr.WhitespaceTokenizerFactory"}, "filters":[{ "class":"solr.LowerCaseFilterFactory"}, { "class":"solr.TrimFilterFactory"}]}}, { "name":"boolean", "class":"solr.BoolField"}, { "name":"date", "class":"solr.DatePointField", "omitNorms":true, "sortMissingLast":true}, { "name":"date_range", "class":"solr.DateRangeField", "omitNorms":true, "omitTermFreqAndPositions":true, "termOffsets":false, "sortMissingLast":true, "termPositions":false, "omitPositions":true}, { "name":"ddbdate", "class":"solr.TextField", "omitNorms":true, "analyzer":{ "tokenizer":{ "class":"solr.StandardTokenizerFactory"}}}, { "name":"float", "class":"solr.TrieFloatField", "omitNorms":true, "positionIncrementGap":"0", "precisionStep":"8"}, { "name":"idType", "class":"solr.StrField", "omitNorms":true, "sortMissingLast":true}, { "name":"int", "class":"solr.IntPointField", "docValues":true}, { "name":"keyword", "class":"solr.StrField", "omitNorms":true, "sortMissingLast":true}, { "name":"long", "class":"solr.LongPointField", "docValues":true}, { "name":"nest_path", "class":"solr.NestPathField", "omitTermFreqAndPositions":true, "omitNorms":true, "maxCharsForDocValues":"-1", "stored":false, "multiValued":false}, { "name":"random", "class":"solr.RandomSortField", "indexed":true}, { "name":"string", "class":"solr.StrField", "omitNorms":true, "sortMissingLast":true}, { "name":"text", "class":"solr.TextField", "omitNorms":true, "sortMissingLast":true, "indexAnalyzer":{ "charFilters":[{ "class":"solr.MappingCharFilterFactory", "mapping":"mapping-ISOLatin1Accent.txt"}], "tokenizer":{ "class":"solr.WhitespaceTokenizerFactory"}, "filters":[{ "class":"solr.WordDelimiterGraphFilterFactory", "catenateNumbers":"1", "splitOnCaseChange":"0", "splitOnNumerics":"0", "catenateWords":"1"}, { "class":"solr.FlattenGraphFilterFactory"}, { "class":"solr.LengthFilterFactory", "min":"3", "max":"64"}, { "class":"solr.LowerCaseFilterFactory"}, { "class":"solr.SynonymFilterFactory", "synonyms":"lemma_replaced_umlaute_lower_case.txt"}, { "class":"solr.TrimFilterFactory"}]}, "queryAnalyzer":{ "charFilters":[{ "class":"solr.MappingCharFilterFactory", "mapping":"mapping-ISOLatin1Accent.txt"}], "tokenizer":{ "class":"solr.WhitespaceTokenizerFactory"}, "filters":[{ "class":"solr.WordDelimiterGraphFilterFactory", "catenateNumbers":"1", "splitOnCaseChange":"0", "splitOnNumerics":"0", "catenateWords":"1"}, { "class":"solr.LowerCaseFilterFactory"}, { "class":"solr.SynonymFilterFactory", "synonyms":"lemma_replaced_umlaute_lower_case.txt"}, { "class":"solr.TrimFilterFactory"}]}}, { "name":"text_ocr", "class":"solr.TextField", "termVectors":true, "storeOffsetsWithPositions":true, "indexAnalyzer":{ "charFilters":[{ "class":"solrocr.ExternalUtf8ContentFilterFactory"}, { "class":"solrocr.OcrCharFilterFactory"}, { "class":"solr.MappingCharFilterFactory", "mapping":"mapping-ISOLatin1Accent.txt"}], "tokenizer":{ "class":"solr.WhitespaceTokenizerFactory"}, "filters":[{ "class":"solr.WordDelimiterGraphFilterFactory", "catenateNumbers":"1", "splitOnCaseChange":"0", "splitOnNumerics":"0", "catenateWords":"1"}, { "class":"solr.FlattenGraphFilterFactory"}, { "class":"solr.LengthFilterFactory", "min":"3", "max":"64"}, { "class":"solr.LowerCaseFilterFactory"}, { "class":"solr.SynonymFilterFactory", "synonyms":"lemma_replaced_umlaute_lower_case.txt"}, { "class":"solr.TrimFilterFactory"}]}, "queryAnalyzer":{ "charFilters":[{ "class":"solr.MappingCharFilterFactory", "mapping":"mapping-ISOLatin1Accent.txt"}], "tokenizer":{ "class":"solr.WhitespaceTokenizerFactory"}, "filters":[{ "class":"solr.WordDelimiterGraphFilterFactory", "catenateNumbers":"1", "splitOnCaseChange":"0", "splitOnNumerics":"0", "catenateWords":"1"}, { "class":"solr.LowerCaseFilterFactory"}, { "class":"solr.SynonymFilterFactory", "synonyms":"lemma_replaced_umlaute_lower_case.txt"}, { "class":"solr.TrimFilterFactory"}]}}], "fields":[{ "name":"_nest_parent_", "type":"idType", "indexed":true, "stored":true}, { "name":"_nest_path_", "type":"nest_path", "stored":true}, { "name":"_root_", "type":"idType", "indexed":true, "stored":false}, { "name":"_version_", "type":"long", "docValues":true, "multiValued":false, "indexed":false, "stored":false}, { "name":"dataset_id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"id", "type":"idType", "multiValued":false, "indexed":true, "required":true, "stored":true}, { "name":"ingest_date", "type":"date", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"issue_id", "type":"idType", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"issuefulltext", "type":"text", "multiValued":false, "indexed":true, "stored":true}, { "name":"language", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"ns_disclaimer_required", "type":"boolean", "indexed":true, "required":false, "stored":true}, { "name":"pagefulltext", "type":"text_ocr", "multiValued":true, "indexed":true, "stored":true}, { "name":"pagename", "type":"string", "indexed":true, "stored":true}, { "name":"pagenumber", "type":"int", "multiValued":false, "indexed":false, "stored":true}, { "name":"paper_title", "type":"text", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"paper_title_fct", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"paper_title_fct_autocomplete", "type":"autocomplete", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"place_of_distribution", "type":"string", "multiValued":true, "indexed":true, "required":false, "stored":true}, { "name":"place_of_distribution_autocomplete", "type":"autocomplete", "multiValued":true, "indexed":true, "required":false, "stored":false}, { "name":"plainpagefulltext", "type":"text", "multiValued":false, "indexed":true, "stored":true}, { "name":"preview_reference", "type":"string", "multiValued":false, "indexed":false, "stored":true}, { "name":"provider", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"provider_autocomplete", "type":"autocomplete", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"provider_ddb_id", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"publication_date", "type":"date_range", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"publication_day_of_week", "type":"int", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"publication_year", "type":"int", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"revision_id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"sort.publication_date", "type":"date", "multiValued":false, "indexed":true, "required":false, "stored":false}, { "name":"supplier_id", "type":"string", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"thumbnail", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"type", "type":"idType", "multiValued":false, "indexed":true, "required":true, "stored":false}, { "name":"zdb_id", "type":"idType", "multiValued":false, "indexed":true, "required":false, "stored":true}, { "name":"zdb_paper_title_fct", "type":"string", "multiValued":false, "indexed":true, "required":false, "stored":false}], "dynamicFields":[{ "name":"random_*", "type":"random"}], "copyFields":[{ "source":"paper_title", "dest":"paper_title_fct"}, { "source":"paper_title", "dest":"paper_title_fct_autocomplete"}, { "source":"place_of_distribution", "dest":"place_of_distribution_autocomplete"}, { "source":"provider", "dest":"provider_autocomplete"}, { "source":"publication_date", "dest":"sort.publication_date"}]}} |
...