According to the Apache website TDB 'can be used as a high performance RDF store on a single machine'. Reading the documentation I don't see where it stores anything. Is it simply storing each resource in its own file within a defined directory as outlined in this tutorial? If so that seems as if it will not scale very well.
How does Apache TDB store RDF data?
1.3k Views Asked by WildBill At
2
There are 2 best solutions below
1

The documentation you linked includes a TDB Design link.
This page covers the technical details of what data structures are used internally and how they are stored on disk
Related Questions in JAVA
- JMH Benchmark on Hadoop YARN
- Using SIMD on amd64, when is it better to use more instructions vs. loading from memory?
- Strange behavior in sun.misc.Unsafe.compareAndSwap measurement via JMH
- Multiple 500 errors when google caliper tries to upload benchmark results to microbenchmarks.appspot.com
- How to run a criterion benchmark for a monadic function
- Why is the first run of microbenchmark always the slowest?
- Cache doesn't seem to be effective
- Why is strconv.ParseUint so slow compared to strconv.Atoi?
- How can I find the missing value more concisely?
- Cache hits, misses and prediction - impact on performance
Related Questions in APACHE
- JMH Benchmark on Hadoop YARN
- Using SIMD on amd64, when is it better to use more instructions vs. loading from memory?
- Strange behavior in sun.misc.Unsafe.compareAndSwap measurement via JMH
- Multiple 500 errors when google caliper tries to upload benchmark results to microbenchmarks.appspot.com
- How to run a criterion benchmark for a monadic function
- Why is the first run of microbenchmark always the slowest?
- Cache doesn't seem to be effective
- Why is strconv.ParseUint so slow compared to strconv.Atoi?
- How can I find the missing value more concisely?
- Cache hits, misses and prediction - impact on performance
Related Questions in RDF
- JMH Benchmark on Hadoop YARN
- Using SIMD on amd64, when is it better to use more instructions vs. loading from memory?
- Strange behavior in sun.misc.Unsafe.compareAndSwap measurement via JMH
- Multiple 500 errors when google caliper tries to upload benchmark results to microbenchmarks.appspot.com
- How to run a criterion benchmark for a monadic function
- Why is the first run of microbenchmark always the slowest?
- Cache doesn't seem to be effective
- Why is strconv.ParseUint so slow compared to strconv.Atoi?
- How can I find the missing value more concisely?
- Cache hits, misses and prediction - impact on performance
Related Questions in TDB
- JMH Benchmark on Hadoop YARN
- Using SIMD on amd64, when is it better to use more instructions vs. loading from memory?
- Strange behavior in sun.misc.Unsafe.compareAndSwap measurement via JMH
- Multiple 500 errors when google caliper tries to upload benchmark results to microbenchmarks.appspot.com
- How to run a criterion benchmark for a monadic function
- Why is the first run of microbenchmark always the slowest?
- Cache doesn't seem to be effective
- Why is strconv.ParseUint so slow compared to strconv.Atoi?
- How can I find the missing value more concisely?
- Cache hits, misses and prediction - impact on performance
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
TDB does not only store the file in a specified folder. The content of the file will be indexed. There are several indexes built for one file: One index is built for S P O order, another for e.g., P O S and so on (as I said for each combination).
Those indexes are stored in the specified folder. Depending on the queries, the appropriate indexes will be loaded.
If you add a RDF file to a TDB store, you will see that many files are created. Although this means that the actual content of the file will be stored multiple times (for each index), it will speed up query execution which is often more preferred that minimal storage usage.