I am working on a deep learning model for "Age Invariant Face detection". I started with preprocessing the data which included face cropping via MTCNN, image contrast correction and sharpening. I then moved to build the model architecture, I designed a custom sequential CNN model. The training and validation losses were close to 1.9 but the model was taking approx 8 hrs for one epoch. I needed fast results so I moved to try on transfer learning. I read a paper which said Resnet-18 performed well on the task so I went with Resnet-18 first. Tried on tuning it by varying learning rates, but for no good. Accuracy wasn't increasing beyond 40%. I then went with Resnet-50, currently, I am working on it. I tried fine-tuning by changing the learning rate manually, tried with different optimizers Adam, SGD. Used learning rate decay. My model started to overfit so then I added dropout, image augmentation to training data and again added weight_decay as it was still overfitting. Right now there is no overfit but the losses aren't decreasing. I am trying on doing my experiments over it, reading many articles per day but feels like I'm stuck.
Fine tuning a model in transfer learning
608 Views Asked by Ankita Prasad At
1
There are 1 best solutions below
Related Questions in DEEP-LEARNING
- Influence of Unused FFN on Model Accuracy in PyTorch
- How to train a model with CSV files of multiple patients?
- Does tensorflow have a way of calculating input importance for simple neural networks
- What is the alternative to module: tf.keras.preprocessing?
- Which library can replace causal_conv1d in machine learning programming?
- My MSE and MAE are low, but my R2 is not good, how to improve it?
- Sketch Guided Text to Image Generation
- ValueError: The shape of the target variable and the shape of the target value in `variable.assign(value)` must match
- a problem for save and load a pytorch model
- Optuna Hyperband Algorithm Not Following Expected Model Training Scheme
- How can I resolve this error and work smoothly in deep learning?
- Difference between model.evaluate and metrics.accuracy_score
- Integrating Mesonet algorithm with a webUI for deepfake detection model
- How can i edit the "wake-word-detection notebook" on coursera so it fit my own word?
- PyTorch training on M2 GPU slower than Colab CPU
Related Questions in DATA-SCIENCE
- KEDRO - How to specify an arbitrary binary file in catalog.yml?
- Struggling to set up a sparse matrix problem to complete data analysis
- How do I remove slashes and copy the values into many other rows in pandas?
- Downloading full records from Entrez
- Error While calling "from haystack.document_stores import ElasticsearchDocumentStore"
- How to plot time series from 2 columns (Date and Value) by Python google colab?
- How to separate Hijri (Arabic) and Gregorian date ranges from on column to separate columns
- How to wait the fully download of a file with selenium(firefox) in python
- Survey that collects anonymous results, but tracks which recipient have responded
- Dataframe isin function Buffer was wrong number of dimensions error
- How to add different colours in an Altair grouped bar chart in python?
- Python Sorting list of dictionaries with nested list
- Float Division by Zero Error with Function Telling Greatest Power of a Number Dividing Another Number
- If a row contains at least two not NaN values, split the row into two separate ones
- DATA_SOURCE_NOT_FOUND Failed to find data source: mlflow-experiment. Please find packages at `https://spark.apache.org/third-party-projects.html
Related Questions in CONV-NEURAL-NETWORK
- The meaning of an out_channel in nn.Conv2d pytorch
- Difference between model.evaluate and metrics.accuracy_score
- PyTorch training on M2 GPU slower than Colab CPU
- LeNet5 & Self-Organizing Maps - RuntimeError: Trying to backward through the graph a second time - PyTorch
- How I resolve : InvalidArgumentError: Graph execution error?
- Ensemble Learning
- Implementing tflite quantized inference in python
- how to set a custom prior for Convolution2DReparameterization?
- this code is not working on tensorflow 2.16.0+ version
- How to cut out a fragment of an image in a contour
- Can I treat CNN channels separately to make placement predictions?
- Image channel error while training CNN model
- What is the size difference between Convolutional Neural Networks that take as input an RGB vs. monochrome image?
- Struggling to put my input images in the right format for CNN (tensorflow)
- Siamese Network Backpropagation
Related Questions in RESNET
- Simple ResNet model can not tell if two monotone images are the same color
- ValueError: Invalid dtype: object in Tensorflow model.fit
- Resnet50 for binary classification predicts all the images to be of the same class
- Heat map of an image using ReaNet
- Is it possible to extract a intermediate layer of ResNet and make feature map which will be used as input of Faster RCNN?
- How do I combine multiple video inputs into a (2+1)D ResNet
- FineTuning Resnet50 for a binary classification problem and I got different validation accuracy using classification_report and model.evaluate
- How to load a batch of images of and split them into patches on the fly with PyTorch>
- Encoder - Decoder neural network architecture with different input and output size
- Error on AWS Lambda when using deoldify/ColorizeArtistic_gen
- Data Normalisation in transformation then Batch Normalisation in ResNet50 pytorch
- Converting a PNG image to a np.array while running the train method in a neural network
- Error with loading state_dict for ResnetEncoder
- Why does ResNet101 have less accuracy than ResNet50 in classification of sport-celebrity dataset?
- ValueError: Shapes (None, 8) and (None, 7) are incompatible for Eye Disease Recognition System
Related Questions in TRANSFER-LEARNING
- Wandb-Error : AttributeError: can't set attribute 'model'
- ValueError: Dimensions must be equal ResNet-50 Transfer Learning TF
- ValueError: The layer sequential has never been called and thus has no defined output
- Integrating Target Variable with Features Before Splitting Sequences Causes Shape Mismatch in ML Model Input
- MusicVAE fine-tune pre-trained model?
- FineTuning Resnet50 for a binary classification problem and I got different validation accuracy using classification_report and model.evaluate
- Transfer learning on sequential model with features and labels TENSORFLOW
- Are you able to use weight generated by a different framework for transfer learning in keras?
- How to implement Transfer Learning by re-training partially a certain layer of an LSTM network?
- Transfer learning in Pytorch overfitting only during all stage
- Finetuned YOLOv8 Classifier generates error during prediction
- Issue doing transfer learning on YOLOv8
- Efficiently Training and Evaluating Multiple FastAI Models using from_df() dataloader
- retraining tensorflow and model definition of dense layers: why x is not overridden?
- Pytorch RuntimeError: Error(s) in loading state_dict and Missing key(s) in state_dict
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Apparently, this is one of the very good cases where feature engineering comes to the light. Try as much as you can to make your images use less computational power by apply some classic feature engineering methods. Using MTCNN is a really good approach and I would deffo look into Feature Engineering, also here is a link that could provide you some AHAA moments performace wise. All the best