background-animation-light-1 background-animation-light-3 background-animation-light-2

Tech Blogs

Insights from Our Experts

Explore this specially curated section of Tech Blogs, where we delve into the intricate realms of Data Engineering and Data Science, sharing authoritative knowledge and cutting-edge expertise.

25 Results

Data Engineering

Slow Queries Monitoring

Working on high nines where the latency of every query matters is a whole different beast in comparison with normal Cassandra use cases where measurements on 99.9% or even 99% are enough. We recently worked on a project which had really tight latencies (threshold in milliseconds on 99.999% of requests) and we needed latency measurement […]

Nenad Božić

|

Sep 20, 2020

Data Engineering

Distributed logging

Browsing through logs is always hard, even when you are on a single node system. You are scrolling up and down, trying to figure out what events happened before a certain error. Often you want to see what followed after the error which happened, then you go back again to see the actual cause etc. […]

Nenad Božić

|

Sep 20, 2020

Data Engineering

Monitoring stack for distributed systems

Microservice architecture on the one hand, and distributed systems on the other, brought new challenges to the IT world. Here are two of them that are becoming increasingly noticeable: management of distributed systems with many nodes which involves deployment and orchestration and monitoring of those systems. Back in the days, you had a single machine […]

Nenad Božić

|

Sep 20, 2020

Data Engineering

Where is my data – debugging SSTables in Cassandra

Apache Cassandra is great for handling huge volumes of data. Everything works really great when you know your data patterns up front and you can make certain decisions based on that experience. We have recently worked for a marketing company which serves ads based on user behavior. The assumption of the use case and data […]

Nenad Božić

|

Sep 20, 2020

Data Engineering

How (not) to start with Apache Cassandra

Within several previous projects, we have held consultations for development teams coming from the relational world and adopting Cassandra database. The experience gained from those projects has triggered me to publish my learning guidelines for it. We see a pattern when adopting Cassandra and it involves taking a shortcut rather than an in-depth understanding of how Cassandra works. I will try to address those shortcuts and red flags and give you advice on how to start.

Nenad Božić

|

Aug 09, 2020

Efficient Finetuning of Large Language Models (LLMs)

While Large Language Models (LLMs) like GPT, Gemini, or Claude are powerful, their large size and resource requirements make them impractical for many tasks. To address this, smaller open-source LLMs can be finetuned and customized for specific needs using techniques like Quantization and Low-Rank Adaptation (LoRA). These techniques reduce memory consumption and improve computational efficiency, […]

Miloš Živić

|

May 07, 2020

Data Engineering

Challenges of Monitoring Distributed Systems

Last October one of our co-founders and senior consultants Nenad Bozic held a presentation on Data Science conference 2.0 about challenges of monitoring distributed systems.

This is the abstract of the presentation.

Nenad Božić

|

Nov 22, 2016

1 2 3
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.