I want to train a agent to place a polyomino (only one for example a square of the measure of 2x2) on a 9x9 field, that is either empty or already contains multiple OTHER (not the 2x2 square one) polyominos. So the observation space would not be continuous. Is this a proper use case for RL?
Can I use Reinforcment Learning for a problem that has a non continous observation space?
50 Views Asked by FlorianFusseder At
1
There are 1 best solutions below
Related Questions in ALGORITHM
- I want to be able to use 4 different variables in a select statement in c ++
- llvm headers do not compile under msvc 2013
- VC++ .net: Functionality from managed DLL is not exported
- Add a picture to Picture Control in a dialog box (error RC2108: expected numerical dialog constant)
- Within a .vcxproj file what are the possible values for the <ConfigurationType> and what do those values mean?
- converting string to a double in visual c++ by parsing
- How to integrate opencv C++ codings with windows application?
- Create string with ESC characters
- What does the thing between "class" and the class name in VC++ mean?
- How to assign (Root)Folder ID in C++? Wherein, those files and folder under it would have the same ID as the RootFolder
Related Questions in MACHINE-LEARNING
- I want to be able to use 4 different variables in a select statement in c ++
- llvm headers do not compile under msvc 2013
- VC++ .net: Functionality from managed DLL is not exported
- Add a picture to Picture Control in a dialog box (error RC2108: expected numerical dialog constant)
- Within a .vcxproj file what are the possible values for the <ConfigurationType> and what do those values mean?
- converting string to a double in visual c++ by parsing
- How to integrate opencv C++ codings with windows application?
- Create string with ESC characters
- What does the thing between "class" and the class name in VC++ mean?
- How to assign (Root)Folder ID in C++? Wherein, those files and folder under it would have the same ID as the RootFolder
Related Questions in REINFORCEMENT-LEARNING
- I want to be able to use 4 different variables in a select statement in c ++
- llvm headers do not compile under msvc 2013
- VC++ .net: Functionality from managed DLL is not exported
- Add a picture to Picture Control in a dialog box (error RC2108: expected numerical dialog constant)
- Within a .vcxproj file what are the possible values for the <ConfigurationType> and what do those values mean?
- converting string to a double in visual c++ by parsing
- How to integrate opencv C++ codings with windows application?
- Create string with ESC characters
- What does the thing between "class" and the class name in VC++ mean?
- How to assign (Root)Folder ID in C++? Wherein, those files and folder under it would have the same ID as the RootFolder
Related Questions in MACHINE-LEARNING-MODEL
- I want to be able to use 4 different variables in a select statement in c ++
- llvm headers do not compile under msvc 2013
- VC++ .net: Functionality from managed DLL is not exported
- Add a picture to Picture Control in a dialog box (error RC2108: expected numerical dialog constant)
- Within a .vcxproj file what are the possible values for the <ConfigurationType> and what do those values mean?
- converting string to a double in visual c++ by parsing
- How to integrate opencv C++ codings with windows application?
- Create string with ESC characters
- What does the thing between "class" and the class name in VC++ mean?
- How to assign (Root)Folder ID in C++? Wherein, those files and folder under it would have the same ID as the RootFolder
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Sure, why not? The simplest versions of reinforcement learning algorithms use a discrete state space (and, indeed, assume for convergence that the agent is able to visit each state sufficiently many times). Even if there are too many states and you have to replace the Q function by a learned approximation (probably a neural net), you can use a one-hot encoding for the input.