I am using cassandra db with billions of records, and I want to store data grouped by date. The problem is that I should query records with different timezones for users, but it stores data by GMT in partition keys. Is it correct to save different date fields for all timezones and create plenty of views? This would expand db to enormous size.
Cassandra partition problems
108 Views Asked by Rostyslav At
1
There are 1 best solutions below
Related Questions in CASSANDRA
- how to create a chess board with Queen in the central position and all its moves in assembler code
- Passing arguments to ENTRYPOINT causes the container to start and run indefinitely
- Apache Cassandra Node Driver Connection
- Simulate Cassandra DB timeout
- How to update Cassandra Lucene index with a new column? rebuild or update index?
- Cassandra JDBC connection string for logstash
- Cassandra OversizedMessageException
- dsbulk unload is failing after ran couple of hours with OOM issue
- Cassandra: "Model keyspace not set" and "Connection name doesn't exist in the registry" Errors
- Unable to cqlsh to a cassandra docker container remotely
- Forward pagination with object mapper in java asyn
- Allow filter in cassandra query
- How to fix bytes unrepaired in cassandra
- Can't install Cassandra using RPM packages for RHEL 9
- Why can't get a connection to Cassandra running on Docker from a Spring Boot instace using spring-boot-starter-data-cassandra on first boot?
Related Questions in TIMEZONE
- I'd like to create a custom time zone converter, any pointers?
- ValueError: setting an array element with a sequence. Trying to make a Skymap in Python
- Timezone Issue with Clickhouse - Asia/Tehran Timezone
- Hive query on HUE shows different timestamp than programatically/on data
- datetime.datetime.now returning "received a naive datetime while time zone support is active"
- calendar keeps showing the wrong time
- unable to update the Google Cloud SQL TimeZone without re-creating the instances
- Identifying invalid dates in Oracle database due to daylight saving time changes
- Timestamp with timezone: works with isql but not with DBD::Firebird
- Django how do I use system timezone?
- How do I fix my timezone issues on my R Shiny App? Unrecognized time zone 'GMT+5'
- How do I convert an epoch into a datetime, taking into account the time zone?
- How to change Postgres timezone
- Time Zones and Daylight Savings Time on restricted server
- Outputting two variables on a single line without changes to the variables
Related Questions in BIGDATA
- How to make an R Shiny app with big data?
- Liquibase as SaaS To Configure Multiple Database as Dynamic
- how to visualize readible big datasets with matplotlib?
- Are there techniques to mathematically compute the amount of searching in greedy graph searching?
- Pyspark & EMR Serialized task 466986024 bytes, which exceeds max allowed: spark.rpc.message.maxSize (134217728 bytes)
- Is there a better way to create a custom analytics dashboard tailored for different users?
- Trigger a lambda function/url with Apache Superset
- How to download, then archive and send zip to the user without storing data in RAM and memory?
- Using bigmemory package in R to solve the Ram memory problem
- spark - How is it even possible to get an OOM?
- Aws Athena SQL Query is not working in Apache spark
- DB structure/file formats to persist a 100TB table and support efficient data skipping with predicates in Spark SQL
- How can I make this matching function faster in R? It currently takes 6-7 days, and this is not practical
- K-means clustering time series data
- Need help related to Data Sets
Related Questions in PARTITION
- Swapping a healthy and unallocated partition in Windows 10
- When the key values are the same, the memory limit is exceeded when making a hash join
- How to update the values in a subpartition table in oracle which has a lot of data?
- PostgreSQL: Prepare SQL statement for determining the size of the partitions of a table
- How to GROUP BY based on newly created MAX() column
- Does ClickHouse support partitioning like traditional RDBMS do, and if so how can I implement it?
- Is Multi level partitioning possible in SQL Server
- How to use "Partition By" in Oracle SQL
- How to add Range-interval partition to existing table in Oracle?
- Finding difference between .RAW/QCOW images and partition
- postgresql partition by list with non primary key not working - insufficient columns in PRIMARY KEY constraint definition
- Current Code Cosmos DB 3.37.1: PartitionKey key mismatch exception
- Filter with partition SQL
- selecting the value of a column that correspond to the last date of a selected date range in Bigquery
- Snowflake external table partition include the field name and value
Related Questions in WIDE-COLUMN-STORE
- Is DynamoDB a wide-column store?
- Is it possible to simultaneously create many new variables using mutate(), across(), starts_with(), and case_when()?
- Columnar/Column-oriented database vs wide-column/column family database
- Calculate weights of data table rows of wide-format
- How to access Amazon Keyspaces via shell?
- R dataset from long to wide - under a specific condition
- is a column family placed one next to the other on disk in HBase? another words, is HBase Column-oriented?
- Wide column vs column family vs columnar vs column oriented DB definition
- What exactly is a wide column store?
- Wide to long multiple rows and only two variables
- Transforming Wide for Long Database, Grouping Variables in R
- The efficiency about big data into node on AgensGraph
- Apache HBase reads slow with thousands of columns
- Wide data frame with 4 columns to long data frame with 3 columns
- Creating Time Series columns in R from Long to Wide format considering Date Range
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
I would suggest to make modification of your application layer to perform normalization of the user's date into GMT & perform search, and then convert dates back into user's timezone after retrieval.
But be careful with using dates as partition key - this could create so-called "hot" partitions when all write traffic will hit the same partitions & won't be evenly distributed between nodes.