Tech Blogs
Explore this specially curated section of Tech Blogs, where we delve into the intricate realms of Data Engineering and Data Science, sharing authoritative knowledge and cutting-edge expertise.
Data Engineering
Working on high nines where the latency of every query matters is a whole different beast in comparison with normal Cassandra use cases where measurements on 99.9% or even 99% are enough. We recently worked on a project which had really tight latencies (threshold in milliseconds on 99.999% of requests) and we needed latency measurement […]
|
Sep 20, 2020
Data Engineering
Browsing through logs is always hard, even when you are on a single node system. You are scrolling up and down, trying to figure out what events happened before a certain error. Often you want to see what followed after the error which happened, then you go back again to see the actual cause etc. […]
|
Sep 20, 2020
Data Engineering
Microservice architecture on the one hand, and distributed systems on the other, brought new challenges to the IT world. Here are two of them that are becoming increasingly noticeable: management of distributed systems with many nodes which involves deployment and orchestration and monitoring of those systems. Back in the days, you had a single machine […]
|
Sep 20, 2020
Data Engineering
Apache Cassandra is great for handling huge volumes of data. Everything works really great when you know your data patterns up front and you can make certain decisions based on that experience. We have recently worked for a marketing company which serves ads based on user behavior. The assumption of the use case and data […]
|
Sep 20, 2020
Data Engineering
Within several previous projects, we have held consultations for development teams coming from the relational world and adopting Cassandra database. The experience gained from those projects has triggered me to publish my learning guidelines for it. We see a pattern when adopting Cassandra and it involves taking a shortcut rather than an in-depth understanding of how Cassandra works. I will try to address those shortcuts and red flags and give you advice on how to start.
|
Aug 09, 2020
While Large Language Models (LLMs) like GPT, Gemini, or Claude are powerful, their large size and resource requirements make them impractical for many tasks. To address this, smaller open-source LLMs can be finetuned and customized for specific needs using techniques like Quantization and Low-Rank Adaptation (LoRA). These techniques reduce memory consumption and improve computational efficiency, […]
|
May 07, 2020
Data Engineering
Last October one of our co-founders and senior consultants Nenad Bozic held a presentation on Data Science conference 2.0 about challenges of monitoring distributed systems.
This is the abstract of the presentation.
|
Nov 22, 2016