Sematext

2007
Brooklyn, NY, USA
Apr 22, 2019   |  By Radu Gheorghe
Entity extraction is, in the context of search, the process of figuring out which fields a query should target, as opposed to always hitting all fields. The reason we may want to involve entity extraction in search is to improve precision. For example: how do we tell that, when the user typed in Apple iPhone, the intent was to run company:Apple AND product:iPhone? And not bring back phone stickers in the shape of an apple?
Apr 16, 2019   |  By sematext
When managing cloud-native applications, it’s essential to have end-to-end visibility into what’s happening at any given time. This is especially true because of the distributed and dynamic nature of cloud-native apps, which are often deployed using ephemeral technologies like containers and serverless functions.
Apr 12, 2019   |  By Radu Gheorghe
Entity extraction is, in the context of search, the process of figuring out which fields a query should target, as opposed to always hitting all fields. The reason we may want to involve entity extraction in search is to improve precision. For example: how do we tell that, when the user typed in Apple iPhone, the intent was to run company:Apple AND product:iPhone? And not bring back phone stickers in the shape of an apple?
Apr 8, 2019   |  By Rafal Kuć
Monitoring Kafka is a tricky task. As you can see in the first chapter, Kafka Key Metrics to Monitor, the setup, tuning, and operations of Kafka require deep insights into performance metrics such as consumer lag, I/O utilization, garbage collection and many more. Sematext provides an excellent alternative to other Kafka monitoring tools because it’s quick and simple to use.
Apr 8, 2019   |  By sematext
As the first part of a three-part series on Apache Kafka monitoring, this article explores which Kafka metrics are important to monitor and why. When monitoring Kafka, it’s important to also monitor ZooKeeper as Kafka depends on it. The second part will cover Kafka open source monitoring tools, and identify the tools and techniques you need to further help monitor and administer Kafka in production.
Nov 7, 2018   |  By Sematext
If you manage Elasticsearch clusters you’ll find this cheat sheet created by Sematext Elasticsearch experts very handy. In it you will find a comprehensive list of copy-paste curl snippets for allocation, caches, segment merges, performance troubleshooting and quite a bit more. Enjoy and share!
Nov 7, 2018   |  By Sematext
The Cloud Native movement and migration of applications to microservice architectures require general visibility and observability into software behavior. OpenTracing aims to offer a consistent, unified, and tracer-agnostic instrumentation API for a wide range of frameworks, platforms and programming languages.
Nov 1, 2018   |  By Sematext
This Elasticsearch Developer Cheat Sheet provides a comprehensive list of key Elasticsearch operations every developer needs – index creation, deletion, mapping manipulation, indexing API, ingestion API, querying, aggregations, document relations (nested and parent child) and more! Enjoy and share!
Nov 1, 2018   |  By Sematext
In this reference architecture document, you will find out about all key Docker metrics to watch. Following that, you will learn how to set up monitoring and logging for a Docker UCP cluster. Specifically, this e-book shows how to use Sematext Docker Agent to collect metrics, events and logs for all Docker hosts and containers. Enjoy and share!
Oct 1, 2018   |  By Sematext
This Solr / SolrCloud Metrics API Cheat Sheet shows you how to access all the new Solr metrics – Jetty Metrics, JVM Metrics, Solr Node Metrics, Core OS metrics, etc. Print it. Copy-paste from it. Use it when troubleshooting Solr performance issues. Enjoy and share!
Apr 22, 2019   |  By Sematext
A user looking for “awesome smartphone 2018” is likely really after “+review:awesome +category:smartphone +release_date:2018”. A clever use of (e)dismax might get us pretty close to where we want, but it’s not real query understanding. There are other ways, of course, like training a model that will, based on the keyword, guess which field it’s looking into.
Dec 10, 2018   |  By Sematext
DockerCon, The #1 Container Industry Conference. Dec 3-5, 2018, Barcelona
Nov 15, 2018   |  By Sematext
Video explaining how to use Logsene for troubleshooting using logs.
Oct 17, 2018   |  By Sematext
Your software stack likely consists of web servers, search engines, queues, databases, etc. Each part of your stack emits its own metrics and logs. Depending on the size of your team and structure, different team members might have permissions to look at one set of data, but not the other. Some data is needed for troubleshooting and can be discarded after just a few days, while more important data might need to be kept for months for legal or capacity planning purposes.
Oct 10, 2018   |  By Sematext
Your software stack likely consists of web servers, search engines, queues, databases, etc. Each part of your stack emits its own metrics and logs. Depending on the size of your team and structure, different team members might have permissions to look at one set of data, but not the other. Some data is needed for troubleshooting and can be discarded after just a few days, while more important data might need to be kept for months for legal or capacity planning purposes.