万本电子书0元读

万本电子书0元读

顶部广告

Mastering Apache Solr 7.x电子书

售       价:¥

4人正在读 | 0人评论 9.8

作       者:Sandeep Nair,Chintan Mehta,Dharmesh Vasoya

出  版  社:Packt Publishing

出版时间:2018-02-22

字       数:35.1万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Accelerate your enterprise search engine and bring relevancy in your search analytics About This Book ? A practical guide in building expertise with Indexing, Faceting, Clustering and Pagination ? Master the management and administration of Enterprise Search Applications and services seamlessly ? Handle multiple data inputs such as JSON, xml, pdf, doc, xls,ppt, csv and much more. Who This Book Is For The book would rightly appeal to developers, software engineers, data engineers and database architects who are building or seeking to build enterprise-wide effective search engines for business intelligence. Prior experience of Apache Solr or Java programming is must to take the best of this book. What You Will Learn ? Design schema using schema API to access data in the database ? Advance querying and fine-tuning techniques for better performance ? Get to grips with indexing using Client API ? Set up a fault tolerant and highly available server with newer distributed capabilities, SolrCloud ? Explore Apache Tika to upload data with Solr Cell ? Understand different data operations that can be done while indexing ? Master advanced querying through Velocity Search UI, faceting and Query Re-ranking, pagination and spatial search ? Learn to use JavaScript, Python, SolrJ and Ruby for interacting with Solr In Detail Apache Solr is the only standalone enterprise search server with a REST-like application interface. providing highly scalable, distributed search and index replication for many of the world's largest internet sites. To begin with, you would be introduced to how you perform full text search, multiple filter search, perform dynamic clustering and so on helping you to brush up the basics of Apache Solr. You will also explore the new features and advanced options released in Apache Solr 7.x which will get you numerous performance aspects and making data investigation simpler, easier and powerful. You will learn to build complex queries, extensive filters and how are they compiled in your system to bring relevance in your search tools. You will learn to carry out Solr scoring, elements affecting the document score and how you can optimize or tune the score for the application at hand. You will learn to extract features of documents, writing complex queries in re-ranking the documents. You will also learn advanced options helping you to know what content is indexed and how the extracted content is indexed. Throughout the book, you would go through complex problems with solutions along with varied approaches to tackle your business needs. By the end of this book, you will gain advanced proficiency to build out-of-box smart search solutions for your enterprise demands. Style and approach An advance guide which will take you through complex problems with solutions along with varied approaches to tackle your business needs by using Apache solr 7.x
目录展开

Title Page

Copyright and Credits

Mastering Apache Solr 7.x

Packt Upsell

Why subscribe?

PacktPub.com

Contributors

About the authors

About the reviewers

Packt is searching for authors like you

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Introduction to Solr 7

Introduction to Solr

History of Solr

Lucene – the backbone of Solr

Why choose Solr?

Benefits of keyword search

Benefits of ranked results

Solr use cases

Social media

Science and research

Search engine

E-commerce

Media and entertainment

Government

Education

What's new in Solr 7?

Replication for SolrCloud

TLOG replicas

PULL replicas

Schemaless improvements

Autoscaling

Default numeric types

Spatial fields

SolrJ

JMX and MBeans

Other changes

Summary

Getting Started

Solr installation

Understanding various files and the folder structure

bin

Solr script

Post script

contrib

DataImportHandler

ContentExtractionLibrary

LanguageIdentifier

Clustering

VelocityIntegration

dist and docs

example

core.properties

zoo.cfg

solr.xml

server

Running Solr

Running basic Solr commands

Production Solr setup

Loading sample data

Loading data from MySQL

Understanding the browse interface

Using the Solr admin interface

Dashboard

Logging

Cloud screens

Tree view

Graph view

Collections or core admin

Java properties

Thread dump

Collection-specific tools

Overview

Analysis

DataImport

Documents

Files

Query

Stream

Schema

Core-specific tools

Summary

Designing Schemas

How Solr works

Getting started with Solr's basics

The schema file of Solr

Understanding field types

Definitions and properties of field types

Field type properties

Field types available in Solr

Understanding date fields

Understanding currencies and exchange rates

Understanding enum fields

Field management

Field properties

Copying fields

Dynamic fields

Mastering Schema API

Schema API in detail

Schema operations

Listing fields, field types, DynamicFields, and CopyField rules

Deciphering schemaless mode

Creating a schemaless example

Schemaless mode configuration

Managed schema

Field guessing

Summary

Mastering Text Analysis Methodologies

Understanding text analysis

What is text analysis?

How text analysis works

Understanding analyzer

What is an analyzer?

How an analyzer works

Understanding tokenizers

What is a tokenizer?

Available tokenizers in Solr

Standard tokenizer

White space tokenizer

Classic tokenizer

Keyword tokenizer

Lower case tokenizer

Letter tokenizer

N-gram tokenizer

Edge n-gram tokenizer

Understanding filters

What is a filter?

Available filters in Solr

Stop filter

Classic filter

Synonym filter

Synonym graph filter

ASCII folding filter

Keep word filter

KStem filter

KeywordMarkerFilterFactory

Word delimiter graph filter

Understanding CharFilter

Understanding PatternReplaceCharFilterFactor

Understanding multilingual analysis

Language identification

Configuring Solr for multiple language search

Creating separate fields per language

Creating separate indexes per language

Understanding phonetic matching

Understanding Beider-Morse phonetic matching

Summary

Data Indexing and Operations

Basics of Solr indexing

Installing Postman

Exploring the post tool

Understanding index handlers

Working with an index handler with the XML format

Index handler with JSON

Apache Tika and indexing

Solr Cell basics

Indexing a binary using Tika

Language detection

Language detection configuration

Client APIs

Summary

Advanced Queries – Part I

Search relevance

Velocity search UI

Query parsing and syntax

Common query parameters

Standard query parser

Advantage

Disadvantage

Searching terms for standard query parser

Term modifiers

Wildcard searches

Fuzzy searches

Proximity searching

Range searches

Boolean operators

Escaping special characters

Grouping terms

Dates and times in query strings

Adding comments to the query string

The DisMax Query Parser

Advantages

DisMax query parser parameters

eDisMax Query Parser

Response writer

JSON

Standard XML

CSV

Velocity

Faceting

Common parameters

Field-value faceting parameters

Range faceting

Pivot faceting

Interval faceting

Highlighting

Highlighting parameters

Highlighter

Unified highlighter (hl.method=unified)

Original highlighter (hl.method=original)

FastVector highlighter (hl.method=fastVector)

Boundary scanners

The breakIterator boundary scanner

The simple boundary scanner

Summary

Advanced Queries – Part II

Spellchecking

Spellcheck parameters

Implementation approaches

IndexBasedSpellChecker

DirectSolrSpellChecker

FileBasedSpellChecker

WordBreakSolrSpellChecker

Distributed spellcheck

Suggester

Suggester parameters

Running suggestions

Pagination

How to implement pagination

Cursor pagination

Result grouping

Result grouping parameters

Running result grouping

Result clustering

Result clustering parameters

Result clustering implementation

Install the clustering contrib

Declare the cluster search component

Declare the request handler and include the cluster search component

Spatial search

Spatial search implementation

Field types

Query parser

Spatial search query parser parameters

Function queries

Summary

Managing and Fine-Tuning Solr

JVM configuration

Managing the memory heap

Managing solrconfig.xml

User-defined properties

Implicit Solr core properties

Managing backups

Backup in SolrCloud

Standalone mode backups

Backup API

Backup status

API to restore

Restore status API

Snapshot API

JMX with Solr

JMX configuration

Logging configuration

Log settings using the admin web interface

Log level at startup

Setting the environment variable

Passing parameters in the startup script

Configuring Log4J for logging

SolrCloud overview

SolrCloud in interactive mode

SolrCloud – core concepts

Routing documents

Splitting shards

Setting up ignore commits from client applications

Enabling SSL – Solr security

Prerequisites

Generating a key and self-signed certificate

Starting Solr with SSL system properties

Performance statistics

Statistics for request handlers

Summary

Client APIs – An Overview

Client API overview

JavaScript Client API

SolrJ Client API

Ruby Client API

Python Client API

Summary

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部