Right now around 100 Gigabytes of data gets generated related to user activities at our company’s different applications/products. The product which I am working on basically consumes all of that and gives insights about users behavior and also helps to Continue Reading
Cassandra
Running PySpark with Cassandra using spark-cassandra-connector in Jupyter Notebook
We are facing several out of memory issues when we are doing operations on big data which present in our DB Cassandra cluster. So we decided its better to use Spark to solve this problem. It became a tough & Continue Reading
Adding Alerts for Cassandra nodetool status and Kafka lag with Telegraf, InfluxDB, and Grafana
Maybe my requirement is little different from others that’s why didn’t get much help from the tools/procedures available. I added 2 more alerts using the similar kind of hack which I explained in my previous article. The Application which I Continue Reading