• Contact
  • Career
  • Insights
Menu
  • Contact
  • Career
  • Insights
  • EN
  • DE
  • Collaboration
    • Agile Project Management
    • Document Management
    • Intranet Portals
    • Knowledge Management
    • Team Communication
  • Search
    • Cognitive Search
    • Contract and Legal Search​
    • E-Commerce​
    • E-Discovery and GDPR Search
    • Enterprise Search​
    • Expert and People Search
    • Location Search​
    • Merger and Acquisition Search
    • Payment Reconciliation​
    • Site Search
    • Warranty Search
  • Cloud
    • Chat Bots
    • Cloud Collaboration Architectures
    • Cloud-Native Applications
    • Data Analytics​
    • Hybrid Infrastructures​
    • Lift & Shift
    • Machine Learning​
  • Services
  • Products
    • Enterprise Search Connectors
    • Search & Retrieval Interface
    • Custom Security Manager
    • Search Center for Microsoft Search
  • About
    • Career (in German)
    • Our Customers
    • Partners & Technologies
    • Insights
    • Webinars
    • Contact
Menu
  • Collaboration
    • Agile Project Management
    • Document Management
    • Intranet Portals
    • Knowledge Management
    • Team Communication
  • Search
    • Cognitive Search
    • Contract and Legal Search​
    • E-Commerce​
    • E-Discovery and GDPR Search
    • Enterprise Search​
    • Expert and People Search
    • Location Search​
    • Merger and Acquisition Search
    • Payment Reconciliation​
    • Site Search
    • Warranty Search
  • Cloud
    • Chat Bots
    • Cloud Collaboration Architectures
    • Cloud-Native Applications
    • Data Analytics​
    • Hybrid Infrastructures​
    • Lift & Shift
    • Machine Learning​
  • Services
  • Products
    • Enterprise Search Connectors
    • Search & Retrieval Interface
    • Custom Security Manager
    • Search Center for Microsoft Search
  • About
    • Career (in German)
    • Our Customers
    • Partners & Technologies
    • Insights
    • Webinars
    • Contact
  • Collaboration
    • Agile Project Management
    • Document Management
    • Intranet Portals
    • Knowledge Management
    • Team Communication
  • Search
    • Cognitive Search
    • Contract and Legal Search​
    • E-Commerce​
    • E-Discovery and GDPR Search
    • Enterprise Search​
    • Expert and People Search
    • Location Search​
    • Merger and Acquisition Search
    • Payment Reconciliation​
    • Site Search
    • Warranty Search
  • Cloud
    • Chat Bots
    • Cloud Collaboration Architectures
    • Cloud-Native Applications
    • Data Analytics​
    • Hybrid Infrastructures​
    • Lift & Shift
    • Machine Learning​
  • Services
  • Products
    • Enterprise Search Connectors
    • Search & Retrieval Interface
    • Custom Security Manager
    • Search Center for Microsoft Search
  • About
    • Career (in German)
    • Our Customers
    • Partners & Technologies
    • Insights
    • Webinars
    • Contact
Menu
  • Collaboration
    • Agile Project Management
    • Document Management
    • Intranet Portals
    • Knowledge Management
    • Team Communication
  • Search
    • Cognitive Search
    • Contract and Legal Search​
    • E-Commerce​
    • E-Discovery and GDPR Search
    • Enterprise Search​
    • Expert and People Search
    • Location Search​
    • Merger and Acquisition Search
    • Payment Reconciliation​
    • Site Search
    • Warranty Search
  • Cloud
    • Chat Bots
    • Cloud Collaboration Architectures
    • Cloud-Native Applications
    • Data Analytics​
    • Hybrid Infrastructures​
    • Lift & Shift
    • Machine Learning​
  • Services
  • Products
    • Enterprise Search Connectors
    • Search & Retrieval Interface
    • Custom Security Manager
    • Search Center for Microsoft Search
  • About
    • Career (in German)
    • Our Customers
    • Partners & Technologies
    • Insights
    • Webinars
    • Contact
  • Contact
  • Career
  • Insights
  • Contact
  • Career
  • Insights
  • EN
  • DE
Blog Post

Enterprise Search with Apache Solr – A Beginner’s Guide

Apache Solr is a broadly used open-source search engine, which is a solid and great foundation for enterprise search and search-based applications. It is the technical backend of many commercial software packages and similar to Elasticsearch, it offers a great flexibility in configuring ranking and scoring models, linguistics as well as query and content processing. Due to this flexibility, it is also often used as a search engine for enterprise search. In this guide, we will briefly give an overview on how to get started with setting up Solr, integrating content sources and displaying results.

April 28, 2023
•
5 min read
Raytion
|
Raytion Insights
|
Enterprise Search with Apache Solr – A Beginner’s Guide

Apache Solr as a Foundation for an Enterprise Search

Many digital workplaces offer their embedded and built-in search engines. If these search experiences do not satisfy your needs and business requirements, there are good reasons to choose an external search engine.

Such an external search engine can be Apache Solr. Apache Solr is a scalable open-source search engine, which can run in failover setups and is capable of indexing millions of documents, in particular in enterprise search scenarios. It does not come with license costs. But for enterprise search, you need to build a few things around it, which commercial search engines usually offer out of the box.

An enterprise search based on Apache Solr comprises the following components:

  1. Displaying search results. A search interface, which displays the search results, offers verticals, filters, result blocks, a type-ahead and more. This is also where users need to authenticate against. Solr only offers a few security plugins and its documentation states, “No Solr API, including the Admin UI, is designed to be exposed to non-trusted parties”.
  2. Indexing data. Connectors are responsible for crawling documents and data from the content sources, which should be indexed.
  3. Security trimming and secure search. A security token store, such as the Raytion Custom Security Manager, is needed to implement secure search or security trimming, respectively.

We will now have a look at these components.

Apache Solr Architecture

Apache-Solr-Architecture-2
Apache Solr Architecture and Its Components

How Do You Index Content Sources?

It is quite easy to integrate third-party sources, such as Atlassian Confluence, ServiceNow or OpenText Content Server, in order to make Apache Solr a fully featured enterprise search. As Apache Solr does not come with built-in connectors, it offers all the APIs needed to index content sources, of course.

Thus, you can either write your own connectors and crawlers – an open source alternative is the framework Apache ManifoldCF for indexing many content sources – or you can use our Raytion Enterprise Search Connectors for indexing.

In order to integrate new content sources into Apache Solr, you need to set up and configure the following components:

  1. Within Apache Solr you need to create a collection with its schema.xml and solrconfig.xml.
  2. You need to have a security token store up and running, e.g., the Raytion Custom Security Manager or similar.
  3. You need to deploy and configure the connector, which indexes the content source.

Search Experience and User Journeys

Having the data of the content source indexed is the first step towards a great enterprise search. But this does not yet display the data to your users. Therefore, you need to have a search experience or a search interface at hand.

In the context of Apache Solr or Elasticsearch, the search interface has multiple responsibilities. In enterprise search scenarios it needs to make sure that only authenticated users can search. So it needs to support authentication providers, such as Azure AD, Google Cloud Identity, Okta, Active Directory or similar.

The search interface has to offer a query pipeline. Within this query pipeline the token store will be queried with the user ID of the searching user. Based on the response, the original search query is transformed, so that it adds an ACL filter (access control list filter), which is needed for secure search (i.e., security trimming). Furthermore, you can integrate synonym expansions, natural language understanding, ranking hints and more within the query pipeline.

After communicating with Apache Solr,  the search interface needs to render the search results, display filters and more. A common aim is that users can quickly distinguish between the search results and understand, which ones help them most in solving their task at hand.

We have customers, who either implement the search interface on their own or with our help. There are open-source frameworks available, which can be used as a starting point for your search interface implementation. Alternatively, our commercial framework Raytion Search & Retrieval Interface is a compatible turnkey solution for Apache Solr.

The Outcome

Apache Solr is a solid and open-source foundation for a great enterprise search experience. It offers full flexibility when it comes to customizing the query processing, content processing and search experience. But you need to build or purchase some components for this great search engine. This includes the search interface, connectors and a security token store. Our customers use the search engine in their enterprise search scenarios, for e-commerce or site searches together with their Adobe Experience Manager or Sitecore.

If you are interested in more information on how to build an enterprise search based on Apache Solr, then please reach out to our experts.

Share this article

Find us on social media

Twitter

Xing

LinkedIn

Other Insights

News
Event

Rays at the B2Run Event

On May 23rd, Raytion participated in another B2Run event, which is a running challenge for Düsseldorf companies. The B2Run is a company run concept to promote team building and social networking.

May 25, 2023
•
3 min read
News
Event

12th Raytion University Day at the Hyatt Regency Hotel

On 3rd May we had our semiannual University Day at Raytion. As part of our company culture, we took a day off at the Hyatt Regency Hotel to listen to talks and discuss our strategy, how we support our customers and where we add value.

May 9, 2023
•
4 min read
Blog Post

Enterprise Search with Azure Cognitive Search – A Beginner’s Guide

Azure Cognitive Search is a PaaS search engine, which serves as a solid and great foundation for search and search-based applications. It offers a high degree of flexibility in configuring ranking and scoring models, linguistics, as well as query and content processing. Due to this flexibility, it is often used as a search engine for enterprise search and here we will briefly explain how to get started.

April 5, 2023
•
7 min read
show all

Are you interested in Raytion's offers?

Please reach out to us

Services

We support the implementation of modern collaboration, search and cloud solutions. We provide and ensured the delivery of cutting-edge solutions to our customers and take care of the necessary strategic alignment. We offer broad consulting and integration services.
learn more

Products

At Raytion we unlock the potential of enterprise search by offering a wide range of in-house developed products and software components. They complement product offers of commercially available enterprise search engines and increase their value.
learn more

About Raytion

Founded in 2001, Raytion is an internationally operating IT-business consultancy that implements state-of-the-art information management, collaboration, search and cloud solutions.
learn more

Interested in a consultation?

Get in touch

Imprint

|

Privacy Policy

Products

Product Support Portal

Enterprise Search Connectors

Search & Retrieval Interface

Custom Security Manager

Search Center for Microsoft Search

Solutions

Collaboration

Search

Cloud

About

Customers

Career

Partners & Technologies

Insights

Webinars

Contact

© Copyright 2001 - 2023 Raytion GmbH, Düsseldorf

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using in our privacy policy.

Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

3rd Party Cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!