Enhancing Multi-Output AIS Prediction with Indirect Sea Level Referencing: Feature Augmentation for Improved Accuracy in Korean Coastal Waters

Article information

J Navig Port Res. 2025;49(1):18-35
Publication date (electronic) : 2025 February 28
doi : https://doi.org/10.5394/KINPR.2025.49.1.18
*Student, Graduate School of Data Science, Seoul National University, Gwanak-gu, Seoul, Korea
**Professor, Graduate School of Data Science, Seoul National University, Gwanak-gu, Seoul, Korea
***Principal researcher, Korea Research Institute of Ships and Ocean Engineering, 32 Yuseong-daero, Yuseong-gu, Daejeon, Korea
Senior researcher, Korea Research Institute of Ships and Ocean Engineering, 32 Yuseong-daero, Yuseong-gu, Daejeon, Korea
Corresponding author, weelon@kriso.re.kr 042)866-3718
Received 2024 October 17; Revised 2024 October 21; Accepted 2025 January 20.

Abstract

This study introduced a novel methodology for enhancing Automatic Identification System (AIS) trajectory forecasting in regions characterized by significant tidal variations through feature augmentation, specifically indirect incorporation of sea level data via the nearest tidal gauge. Traditional AIS prediction models predominantly utilize features such as latitude, longitude, speed over ground (SOG), and course over ground (COG) for time series forecasting. However, these models often overlook the influence of tidal fluctuations, which can significantly impact prediction accuracy in areas with pronounced tidal changes. To address this limitation, we proposed a feature augmentation approach by incorporating the Haversine distance to the nearest tidal gauge and the real-time sea level at that gauge as additional features. Direct access to sea level data at a vessel’s precise location presents practical challenges, making this indirect method an efficient and effective solution. Through comprehensive analyses across multiple deep learning models and test scenarios, our results demonstrate that this augmented feature set can substantially improve AIS forecasting performance in regions with significant tidal variation surrounding the Korean Peninsula.

1. Introduction

The accelerating rise in sea levels, driven by factors such as global warming and the potential collapse of the Atlantic Meridional Overturning Circulation (AMOC) (van Westen et al., 2024), poses significant challenges for maritime navigation and vessel route prediction. The dynamic nature of these changes is reshaping coastal topographies and tidal patterns, making accurate predictions of vessel movements near coastlines more difficult. Traditionally, predictive models for vessel movement prediction have relied on historical AIS data, focusing on vessel trajectories, speed, and heading to forecast future movements (Li et al., 2023). However, these models often overlook real-time environmental conditions such as tidal fluctuations, which can lead to inaccuracies, especially in regions with significant tidal changes.

Incorporating real-time sea level data into vessel route prediction models is critical for improving accuracy, particularly in coastal regions where tidal variations are prominent. This is especially relevant for maritime environments like those around the Korean Peninsula, where both long-term sea level trends and short-term tidal effects create volatile conditions (Tebaldi et al., 2021). However, direct access to real-time sea level measurements at precise vessel locations is often challenging due to the lack of high-resolution data. Organizations like the International Hydrographic Organization (IHO) provide data, such as S-104 water level information, for surface navigation (Stewart et al., 2019). However, access to and availability of this data is limited. Besides the data provided is mostly not high-resolution sea level information at the exact location of the vessel, but rather sea level data segmented into grids over the region, offering only low-resolution information for each grid, which poses a significant limitation.

This study introduces an indirect sea level referencing method, which integrates real-time sea level data from the nearest tidal gauge and incorporates the Haversine distance to these gauges. By doing so, the prediction accuracy of vessel routes in coastal areas with fluctuating tides is enhanced. The waters surrounding the Korean Peninsula, with their distinct maritime environments, offer an ideal context for this investigation. Major ports such as Busan and Incheon, situated in areas with varying tidal ranges, underscore the necessity for more precise and reliable predictive models (Kim et al., 2024).

2. Literature Review

AIS-based vessel route prediction has traditionally relied on historical trajectory data. Earlier studies applied statistical and machine learning models, such as the Extended Kalman Filter (EKF) (Perera et al., 2010), k-means clustering with Artificial Neural Networks (Gan et al., 2016), and k-Nearest Neighbors (kNN) classification (Duca et al., 2017) to improve prediction accuracy.

More recently, deep learning models like Recurrent Neural Networks (RNNs) (Capobianco et al., 2021) have shown greater effectiveness in open-sea environments. Further advancements, including Long Short-Term Memory (LSTM) networks (Tang et al., 2019), LSTM encoder-decoder architectures (Nguyen et al., 2018), Bidirectional LSTM (Bi-LSTM) (Gao et al., 2018), Bidirectional Gated Recurrent Units (Bi-GRU) (Wang et al., 2020), and GRU encoder-decoder frameworks (You et al., 2020), have improved predictive performance by capturing long-range dependencies in time series data. Recently, Transformer models have been adopted to enhance AIS predictions by utilizing novel data representations and loss functions (Nguyen et al., 2024).

Despite advances in modeling, many studies still rely on conventional AIS features and overlook dynamic environmental factors, such as sea level variations, particularly in coastal areas (Tebaldi et al., 2021). Real-time environmental data, especially sea level measurements, can enhance AIS-based predictions, but availability and integration remain challenging. High-resolution sea level data require extensive coastal infrastructure, such as tidal gauges (Stewart et al., 2019), and integrating these data into models is further complicated by the variability of coastal environments where tidal ranges vary significantly (Kantha et al., 1996).

Some recent research has integrated environmental factors like sea level and tidal variations into deep learning models for improved predictions. Kim et al. (2024) developed a U-Net model for surface sediment classification in tidal flats using UAV images, highlighting the importance of tidal channel density in improving accuracy. Shahabi and Tahvildari (2024) introduced a CNN-LSTM model for predicting coastal water levels in the Chesapeake Bay, combining wind data and astronomical tides. Their model achieved accuracy comparable to hydrodynamic models, demonstrating the effectiveness of deep learning in managing coastal water levels and flood risks.

Additionally, Guo et al. (2024) investigated the use of environmental factors like wind and tides to optimize the navigation of unmanned surface vehicles (USVs). Kantha et al. (1996) examined the shallow water tides around Korea, emphasizing the significant impact of tidal variations, particularly in the Yellow Sea, where the M4 tide can reach amplitudes of up to 10cm near the coast. Their work highlighted the importance of high-resolution models for accurate tidal predictions in coastal regions and the challenges of forecasting sea levels in such dynamic environments.

3. Definitions and Problem Statements

3.1 Definitions

3.1.1 Ship Trajectory

A ship trajectory, denoted as Traj, is a sequence of 120 time-stamped points, which represents a fixed-length window covering both past and future vessel movements at 10-minute intervals. This window size of 120 was chosen to capture 5 hours of historical data (60 points) and 5 hours of future predictions (60 points), allowing the model to make accurate forecasts based on the previous trajectory:

(1) Traj={Po1,,Poi,,Po120}

Each point PoiT is characterized by a tuple that includes various features depending on the model being used (Table 1). The basic features consist of Latitude (LA) and Longitude (LO), which represent the vessel’s geographical coordinates at each time step. Additionally, Speed Over Ground (SOG) captures the vessel’s velocity relative to the Earth’s surface, while Course Over Ground (COG) represents the actual direction in which the vessel is moving. To further capture vessel movement dynamics, the differences in latitude (ΔLA) and longitude (ΔLO) between consecutive points are included, helping the model better understand directional changes over time.

Table of features

In more advanced models, additional features are incorporated. The Sea Level feature, for example, introduces tidal data collected from the nearest gauge, reflecting the water level at the vessel’s location. This feature is particularly important for understanding how fluctuations in sea level can affect vessel movement. Another key feature is the Distance, which represents the Haversine distance between the vessel’s position and the nearest tidal gauge. This provides spatial context to the sea level data by measuring how far the vessel is from the location where tidal information is collected. The inclusion of these features in more complex models allows for more accurate predictions by accounting for environmental factors that influence vessel trajectories.

The Haversine distance d between the vessel’s position (LA1,LO1) and the tidal gauge’s position (LA2,LO2) is calculated as follows:

(2) a=sin2(Δρ2)+cos(ρ1)cos(ρ2)sin2(Δλ2)
(3) d=2Rarcsin(a), where R=6371 km

Here, ρ1 and ρ2 represents the latitudes of the vessel and the gauge, and ∆ρ and ∆λ denote the differences in latitude and longitude, respectively.

3.1.2 Ship Trajectory Dataset

The dataset of ship trajectories is defined as:

(4) TD={Traj1,,Trajj,,Trajn}

where j=1, …, n denotes the number of trajectories.

3.2 Problem Generation and Statements

The geographical complexity of the Korean Peninsula, bordered by three distinct seas with varying tidal ranges, presents unique challenges for accurate ship trajectory prediction, as shown in Fig. 1. The dots in the map of Fig. 1 directly reflect the data represented in Fig. 2, where the sea level variations for each tidal gauge are plotted.

Fig. 1.

Tidal gauges along coastlines in Korea

Fig. 2.

Sea level range at tidal gauges

The East Sea shows relatively stable sea level variations within 100cm, the South Sea experiences more significant fluctuations, ranging from 100 to 500cm, and the West Sea has even greater variations, from 500 to 900cm. These three distinct regions, separated based on sea level differences, were segmented using a one-dimensional clustering analysis, which identified significant changes in sea level behavior, particularly around the 100cm and 500cm marks, and these values were used as the boundaries for segmentation in Fig. 2.

Among these regions, the South Sea poses the most complex challenges due to its strong currents, numerous islands, and substantial tidal changes. Additionally, as ships approach coastal areas, especially near ports, the increased traffic density further complicates trajectory predictions.

4. Experiments and Discussion

The prediction tasks in this study were conducted using six time-series deep learning models: RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, and Transformer. The dataset comprised AIS data collected over a six-month period from the waters surrounding the Korean Peninsula, along with concurrent sea level data obtained from tidal gauges positioned across various coastal regions nationwide. The performance of the models was evaluated on the test data using six different metrics. Fig. 3 illustrates the overall flow of this study in a single diagram.

Fig. 3.

Flow chart of the experiment

4.1 Dataset and Experimental Settings

4.1.1 Dataset

The dataset employed in this study consists of two primary components. The first component is the Automatic Identification System (AIS) data, which encompasses AIS records for cargo vessels over a six-month period, from June 1, 2022, to November 30, 2022. The defined Region of Interest (ROI) is a rectangular area bounded by the coordinates (31.0°N to 39.0°N) in latitude and (124.0°E to 132.0°E) in longitude.

Overview of AIS Dataset

Prior to preprocessing, the dataset comprised 9,546 unique vessel identifiers. AIS messages were aggregated by vessel ID to form continuous trajectories. To account for instances of anchorage or data loss, any trajectory with a data gap exceeding one hour was segmented into separate trajectories. The mean trajectory duration was 454.2 minutes, and the dataset initially contained 216,517 trajectories. Fig. 4 presents a sample of the indirect sea level referencing approach by illustrating the AIS trajectory in conjunction with data from the nearest tidal gauge and the distance to the gauge.

Fig. 4.

AIS sequence combined with sea level and proximity data before split into fixed length of windows

Additionally, the sea level data influenced by tidal forces was sourced from the public Smart Tidal Forecasting system provided by the Korea Hydrographic and Oceanographic Agency. This data, collected from June 1, 2022, to November 30, 2022, aligns precisely with the AIS trajectory data, capturing sea level measurements at one-minute intervals from tidal gauges across the country. Fig. 2 illustrates the classification of tidal gauges based on sea level variation into three categories, represented by different colors. A total of 129 tidal gauges were utilized during the study period, with 22 gauges showing a sea level variation of less than 100cm (blue), 68 gauges with variations up to 500cm (green), and 39 gauges with variations exceeding 500cm (red). This classification naturally reflects the geographical distinctions between the East Sea, South Sea, and West Sea, corresponding to the varying tidal conditions observed in these regions.

4.1.2 Data Pre-processing

The AIS data underwent several preprocessing steps to ensure data quality and consistency for training. Given that AIS data is collected at irregular time intervals across various vessels, the data was first resampled to a uniform 1-minute interval. This resampling process standardized the data and ensured consistency across all trajectories, a crucial step for accurate time-series prediction. To further refine the dataset, several filtering criteria were applied.

Trajectories from vessels with a gross tonnage below 30 tons were removed, as these do not represent typical cargo ship movements and could introduce noise into the predictions.

Any trajectory where the Speed Over Ground (SOG) was less than 3 knots or greater than 30 knots was excluded to eliminate unrealistic vessel speeds that might represent sensor errors or anomalies.

For each trajectory, the total distance traveled between the first and last point was calculated using the Haversine formula. If this distance was less than 35 nautical miles, the trajectory was excluded, as such movements likely represent stationary ships or those performing local maneuvers like docking or circling.

Any trajectory shorter than 600 minutes was excluded to maintain consistency in the length of the trajectories used for training. For trajectories longer than 600 minutes, a sliding window approach was applied. The trajectory was split into 600-minute segments, with a 60-minute step size to minimize data loss and capture multiple subsections of longer journeys.

After the initial filtering, the data was downsampled to a 10-minute interval to balance model performance and computational efficiency. The new resampling interval maintained essential temporal trends while reducing the dataset size for smoother training.

To capture vessel movement directionality, the changes in latitude (ΔLA) and longitude (ΔLO) between consecutive points were calculated using the backward filling method, ensuring the data remained consistent without introducing gaps.

As a result, the numeric features used for training, including latitude (LA), longitude (LO), speed over ground (SOG), course over ground (COG), change in latitude (ΔLA), change in longitude (ΔLO), sea level, and distance to the nearest tidal gauge, were all normalized using min-max scaling. This ensured that all feature values were within the 0-1 range, facilitating efficient and stable model training. The scaler was fit exclusively on the training data to prevent any data leakage. Then to focus on relevant test cases, trajectories within a 20-nautical-mile radius of the nearest tidal gauge were selected. These trajectories were categorized into nine distinct test cases based on sea level variations, with intervals of 100cm ranging from 0 to 900cm, as illustrated in Fig. 5.

Fig. 5.

Visualization of AIS split sequences by tidal differences at ranges of 100cm

4.1.3 Training, Validation, and Hyperparameter Settings

The training and validation process adhered to a Holdout Validation strategy, where the dataset was divided into 70% for training, 15% for validation, and 15% for testing. Specifically, trajectories starting between June 1, 2022, and September 30, 2022, were allocated for training (39,202 trajectories), October 1, 2022, to October 31, 2022, was used as the validation period (9,013 trajectories), and November 1, 2022, to November 30, 2022, comprised the test set (8,641 trajectories).

During validation, each model’s performance was monitered to identify the best model configuration by evaluating validation loss. The best-performing model on the validation set was then saved and used for subsequent testing on the unseen test set. This Holdout Validation approach was chosen over cross-validation due to the ample size and density of the dataset, making holdout validation sufficient for capturing the model’s generalizability. By reserving a stable test set for final evaluation, the methodology ensures that the performance metrics accurately reflect the model’s ability to generalize to new data without the potential variability introduced by cross-validation.

The six deep learning models evaluated in this study — RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, and Transformer — were implemented using the PyTorch framework. During training, the Adam optimizer was employed for parameter updates, known for its ability to handle sparse gradients and non-stationary objectives. Adam adapts the learning rate for each parameter individually by estimating the first and second moments of the gradients, thereby improving optimization efficiency, particularly in complex objective functions.

Early stopping was used as a key mechanism to prevent overfitting. Training was terminated if the validation loss did not improve for 10 consecutive epochs, ensuring that the models did not continue training once they reached a performance plateau on unseen data. Furthermore, dropout was applied at a rate of 0.1 during training to regularize the models by randomly deactivating neurons, helping prevent over-reliance on specific features and promoting generalization.

The hyperparameters employed across all models are summarized in Table 3, while the experimental environment is described in Table 4. These configurations were consistently applied to all models, ensuring a fair comparison of performance across different architectures.

Relevant hyperparameter settings

Experimental environment

4.2 Evaluation Metrics

This study employs six evaluation metrics, as utilized in Li et al. (2023), to rigorously assess the trajectory prediction performance of six prediction methods from both global and local perspectives. These metrics — Mean Squared Error (MSE), Mean Absolute Error (MAE), Symmetric Mean Absolute Percentage Error (SMAPE), Final Displacement Error (FDE), Final Displacement (FD), and Average Euclidean Distance (AED) — offer a comprehensive evaluation framework that captures different dimensions of model accuracy and error distribution.

MSE penalizes larger errors more heavily by squaring the differences between predicted and actual values, making it particularly sensitive to outliers. MAE, on the other hand, calculates the average of absolute errors, providing a balanced measure of accuracy by treating all errors equally, regardless of size. SMAPE offers a scale-independent view of prediction accuracy by using a percentage-based error metric, making it particularly useful for comparing trajectories of different magnitudes. FDE focuses on the accuracy of the final point in the trajectory by measuring the Euclidean distance between the predicted and actual final positions. FD assesses the overall accuracy of the entire trajectory by capturing the total displacement error from start to finish. Finally, AED provides a holistic measure of accuracy across the full trajectory by calculating the average Euclidean distance between the predicted and actual positions at each time step. The specific formulas are defined as follows:

(5) MSE=1ni=1n(yi^yi)2
(6) MAE=1ni=1n|yi^yi|
(7) SMAPE=1ni=1n|yi^yi|(|yi^||yi|)/2
(8) FDE=(yn^yn)2
(9) FD=maxi[1,n]di
(10) AED=1ni=1ndi
(11) di=(yi^( lat )yi( lat ))2+(yi^( lon )yi( lon ))2

where yi^ and yi represent the predicted and actual values for the i-th point, respectively. yi^(LA) and yi^(LO) denote the predicted latitude and longitude while yi(LA) and yi(LO) denote the corresponding actual latitude and longitude for the i-th trajectory point. Here, di represents the Euclidean distance between the predicted and actual positions at the i-th point and n is the total number of trajectory points in the test samples.

4.3 Experiment Results

4.3.1 Overall Results

This section presents the experimental results of various deep learning models in a multi-output prediction framework, predicting the next 5 hours of ship trajectories based on the previous 5 hours of data. Figures 6 through 14 depict the prediction outcomes of the selected models with their optimal feature configurations: RNN and Bi-GRU with the Basic feature set, LSTM and GRU with the Distance feature set, and Bi-LSTM and Transformer with the Extended feature set. Across all test cases, the Transformer model consistently demonstrated superior performance, closely aligning its predictions with the actual routes.

Fig. 6.

Prediction visualization in 0-100cm range

Fig. 7.

Prediction visualization in 100-200cm range

Fig. 8.

Prediction visualization in 200-300cm range

Fig. 9.

Prediction visualization in 300-400cm range

Fig. 10.

Prediction visualization in 400-500cm range

Fig. 11.

Prediction visualization in 500-600cm range

Fig. 12.

Prediction visualization in 600-700cm range

Fig. 13.

Prediction visualization in 700-800cm range

Fig. 14.

Prediction visualization in 800-900cm range

Figures 15 through 20 and Tables 5 through 10 provide the detailed performance metrics obtained through this evaluation process, offering a quantitative perspective on each model’s accuracy across different sea level conditions. The models were trained on four months of AIS data from June to September, validated on October data, and evaluated using the November data as the test set. The test set was divided into specific test cases to measure model performance under varying conditions, with metrics obtained for each case based on the predictions generated by the trained models.

Fig. 15.

Prediction results with RNN model

Fig. 16.

Prediction results with LSTM model

Fig. 17.

Prediction results with GRU model

Fig. 18.

Prediction results with Bi-LSTM model

Fig. 19.

Prediction results with Bi-GRU

Prediction values with RNN

In particular, Fig 8 illustrates a scenario where the Transformer model’s predictions were almost indistinguishable from the actual route, while other models exhibited significant deviations. This demonstrates the effectiveness of combining the multi-output prediction framework with feature augmentation, particularly with sea level data and Haversine distance to tidal gauges. The Extended feature configuration of the Transformer was notably robust, excelling in complex environments like the South Sea, which is characterized by strong tidal variations and intricate coastlines.

These results underscore the importance of incorporating additional environmental features, such as sea level, into trajectory prediction tasks. The Transformer model's superior performance across multiple tidal regions highlights the advantage of using augmented features, especially in maritime environments with complex tidal dynamics.

4.3.2 Model-Specific Analysis

The detailed analysis of each model focused on six key metrics. Below, the performance of each model is discussed with an emphasis on the impact of feature augmentation and how the models responded to different sea level ranges.

The RNN model (Figure 15, Table 5) struggled to fully leverage the Extended features. In the 0-100cm and 600-700cm sea level ranges, the Basic feature set outperformed the Extended set in all metrics, indicating that adding sea level and distance features did not improve short-term predictions. Similarly, in the 100-200cm and 700-800cm ranges, the Basic model performed better across most metrics. In the 800-900cm range, the Extended features led to improved performance in all metrics except FD, suggesting that while additional features helped in highly dynamic environments, challenges remained in predicting the final destination.

The LSTM model (Figure 16, Table 6) demonstrated moderate improvements over RNN, especially in low sea level variation environments. In the 0-100cm range, Basic features outperformed Extended features in all metrics except FDE. As sea level variations increased, the Extended features generally led to better performance across most metrics, particularly in the 100-500cm range. However, similar to RNN, FD remained lower with Basic features in the ranges of 600cm and above, indicating a difficulty in maintaining final destination accuracy with Extended features in extreme conditions.

Prediction values with LSTM

The GRU model (Figure 17, Table 7) showed more substantial gains with the Extended features, particularly in sea level ranges above 500cm. Metrics like RMSE, MAE, and SMAPE consistently showed better performance with the Extended set. In the 800-900cm range, however, the FD metric increased for the Extended model, indicating a trade-off: while the overall trajectory accuracy improved, FD suffered. This suggests that additional features helped in short- and medium-term predictions, but introduced complexities in extreme tidal environments.

Prediction values with GRU

For both the Bi-LSTM (Figure 18, Table 8) and Bi-GRU (Figure 19, Table 9) models, lower overall error rates were observed compared to their unidirectional counterparts. Despite improved performance in trajectory prediction, the additional features may have overcomplicated the prediction task in extreme tidal environments. Interestingly, FD values for these models did not degrade as significantly, suggesting that while the overall trajectory predictions struggled, the final displacement remained stable.

Prediction values with Bi-LSTM

Prediction values with Bi-GRU

The Transformer model (Figure 20, Table 10) outperformed all other models across all sea level ranges, with significantly lower error rates in all of the metrics. However, in the 800-900cm range, the Transformer model exhibited increased errors in FDE, FD, and AED with the Extended features. This suggests that in that specific range while the Transformer was highly effective at short- to medium-term predictions, it struggled with predicting the final destination in extreme tidal environments. The increased errors in FDE and FD indicate a gap between trajectory accuracy and final displacement accuracy, especially in dynamic environments.

Fig. 20.

Prediction results with Transformer

Prediction values with Transformer

Recent research in time series forecasting supports these findings. Tang and Matteson (2021) demonstrated that Transformer architectures, particularly when combined with probabilistic modeling, excel in handling long-range dependencies and non-deterministic dynamics in time series data. Their probabilistic Transformer model provided better long-term forecasts while also capturing uncertainty, which aligns with the performance observed in this study. However, the challenges in predicting final destination accuracy in extreme conditions suggest that further refinement is needed.

4.3.3 Sea Level Effect

A clear trend emerges across all models: as the range of sea level differences increases, prediction errors rise. This highlights the strong influence of sea level variations on ship trajectory prediction, particularly in areas with significant tidal differences.

In the East Sea (0-100cm), models generally exhibited lower error rates. The relatively stable sea level and straightforward coastline contributed to easier predictions, limiting the impact of feature augmentation. For instance, the Transformer model showed only marginal improvements with the Extended features in this region.

In contrast, the South Sea (100-500cm) presented a more challenging environment, characterized by significant tidal changes and a complex coastline. In this region, the Transformer model’s Extended configuration significantly outperformed the Basic model, demonstrating the value of incorporating sea level data and distance to tidal gauges. These features helped capture the dynamic tidal variations, leading to better performance.

In the West Sea (500-900cm), which experiences extreme tidal variations, the Transformer model continued to outperform the Basic model, though the improvement rate diminished compared to the South Sea. In the 800-900cm range, the Extended model even underperformed in some cases, particularly with the Transformer and Bi-GRU models. One possible explanation is that the 10-hour prediction window extends beyond regions where sea level impacts vessel movement, reducing the effectiveness of the additional features.

Overall, the results demonstrate that sea level variations significantly impact ship trajectory prediction. The Extended model’s inclusion of sea level data and Haversine distance to the nearest tidal gauge proves particularly effective in regions with moderate tidal variations, but faces limitations in extreme environments like the West Sea.

5. Conclusion

This study conducted experiments leveraging the prominent sea level differences observed along the coasts of Korea’s East, West, and South Seas to demonstrate the significance of an indirect sea level referencing approach. Recognizing the challenge of obtaining real-time sea level data at a vessel’s current location, this method references sea level data from the nearest tidal gauge, augmented by the Haversine distance to the gauge as an additional feature. This indirect referencing method consistently outperformed basic feature models across most of the deep learning models, showing improvements in key metrics and test cases. Notably, the Transformer model exhibited superior performance, particularly benefiting from feature augmentation, which effectively captured long-range dependencies in time series forecasting.

While the simulations covered a broad range of tidal conditions, the study primarily focused on sea level as an environmental factor. To enhance future research, the inclusion of additional environmental variables, such as wind and wave data, could offer further insights and refine model predictions in more dynamic coastal conditions. Incorporating these factors would provide a more comprehensive understanding of the various influences on ship trajectory prediction, particularly in rapidly changing maritime environments.

Additionally, this study adopted a multi-output approach, where the past 5 hours of trajectory data were used to predict the next 5 hours. As a potential direction for future work, research could explore a multi-step prediction method, enabling the model to begin predictions based on shorter trajectories and dynamically generate forecasts over more frequent intervals. This would enhance the flexibility and applicability of the models in real-world navigation, particularly in time-sensitive scenarios.

References

[1]. Capobianco S., Millefiori L. M., Forti N., Braca P., Willett P.. 2021;Deep Learning Methods for Vessel Trajectory Prediction Based on Recurrent Neural Networks. IEEE Transactions on Aerospace and Electronic Systems 57(6):4329–4346.
[2]. Duca A. L., Bacciu C., Marchetti A.. 2017;A K-nearest neighbor classifier for ship route prediction. in Proceedings of OCEANS-Aberdeen :1–6.
[3]. Gao M., Shi G., Li S.. 2018;Online prediction of ship behavior with automatic identification system sensor data using bidirectional long short-term memory recurrent neural network. Sensors 18(12):4211.
[4]. Gan S., Liang S., Li K., Deng J., Cheng T.. 2016;Ship trajectory prediction for intelligent traffic management using clustering and ANN. in Proceedings of the UKACC 11th International Conference on Control :1–6.
[5]. Giuliari F., Hasan I., Cristani M., Galasso F.. 2021;Transformer Networks for Trajectory Forecasting. in 2020 25th International Conference on Pattern Recognition (ICPR) :10335–10342, IEEE.
[6]. Guo S., Wang H., Wang S.. 2024;Managing Port Disruption Through Sailing Speed Optimization for Sustainable Maritime Transportation. Cleaner Logistics and Supply Chain 11:100153.
[7]. Kantha L. H., Bang I. K., Choi J. K., Suk M. S.. 1996;Shallow Water Tides in the Seas around Korea. Journal of the Korean Society of Oceanography 31(3):123–133.
[8]. Kim H. C., Son W. J., Lee J. S., Cho I. S.. 2024;Identification of Maritime Areas With High Vessel Traffic Based on Polygon Shape Similarity. IEEE Access 12:92253–92267.
[9]. Kim K. L., Woo H. J., Jou H. T., Jung H. C., Lee S. K., Ryu J. H.. 2024;Surface sediment classification using a deep learning model and unmanned aerial vehicle data of tidal flats. Marine Pollution Bulletin 198:115823.
[10]. Li H., Jiao H., Yang Z.. 2023;AIS data-driven ship trajectory prediction modelling and analysis based on machine learning and deep learning methods. Transportation Research Part E: Logistics and Transportation Review 175:103152.
[11]. Nguyen D., Fablet R.. 2024;A Transformer Network with Sparse Augmented Data Representation and Cross Entropy Loss for AIS-Based Vessel Trajectory Prediction. IEEE Access
[12]. Nguyen D. D., Le Van C., Ali M. I.. 2018;Vessel trajectory prediction using sequence-to-sequence models over spatial grid. in Proceedings of the 12th ACM International Conference on Distributed Event-Based Systems :258–261.
[13]. Perera L. P., Soares C. G.. 2010;Ocean vessel trajectory estimation and prediction based on extended Kalman filter. in Proceedings of the 2nd International Conference on Adaptive and Self-Adaptive Systems and Applications :14–20.
[14]. Shahabi A., Tahvildari N.. 2024;A deep-learning model for rapid spatiotemporal prediction of coastal water levels. Coastal Engineering 190:104504.
[15]. Souhaib Ben Taieb, Antti Sorjamaa, Gianluca Bontempi. 2010;Multiple-output modeling for multi-step-ahead time series forecasting. Neurocomputing 73(10-12):1950–1957. ISSN 0925-2312.
[16]. Stewart H. A., Jamieson A. J.. 2019;The Five Deeps: The Location and Depth of the Deepest Place in Each of the World’s Oceans. Earth-Science Reviews 197:102896.
[17]. Tang B., Matteson D. S.. 2021;Probabilistic Transformer for Time Series Analysis. Advances in Neural Information Processing Systems 34:18102–18113.
[18]. Tang H., Yin Y., Shen H.. 2019;A model for vessel trajectory prediction based on long short-term memory neural network. Journal of Marine Engineering and Technology 21(3):136–145.
[19]. Tebaldi C., Ranasinghe R., Vousdoukas M., Rasmussen D. J., Vega-Westhoff B., Kirezci E., Kopp R. E., Sriver R., Mentaschi L.. 2021;Extreme sea levels at different global warming levels. Nature Climate Change 11:746–751.
[20]. van Westen R. M., Kliphuis M., Dijkstra H. A.. 2024;Physics-based early-warning signal shows that AMOC is on a tipping course. Science Advances 10(6):eadk1189.
[21]. Wang Y., Lu C., Wu P., Zhang X.. 2024;Path planning for unmanned surface vehicle based on improved Q-Learning algorithm. Ocean Engineering 292:116510.
[22]. Wang C., Ren H., Li H.. 2020;Vessel trajectory prediction based on AIS data and bidirectional GRU. in Proceedings of the International Conference on Computer Vision, Image and Deep Learning (CVIDL) :260–264.
[23]. You L., Xiao S., Peng Q.. 2020;ST-Seq2Seq: A spatio-temporal feature-optimized Seq2Seq model for short-term vessel trajectory prediction. IEEE Access 8:218565–218574.

Article information Continued

Fig. 1.

Tidal gauges along coastlines in Korea

Fig. 2.

Sea level range at tidal gauges

Fig. 3.

Flow chart of the experiment

Fig. 4.

AIS sequence combined with sea level and proximity data before split into fixed length of windows

Fig. 5.

Visualization of AIS split sequences by tidal differences at ranges of 100cm

Fig. 6.

Prediction visualization in 0-100cm range

Fig. 7.

Prediction visualization in 100-200cm range

Fig. 8.

Prediction visualization in 200-300cm range

Fig. 9.

Prediction visualization in 300-400cm range

Fig. 10.

Prediction visualization in 400-500cm range

Fig. 11.

Prediction visualization in 500-600cm range

Fig. 12.

Prediction visualization in 600-700cm range

Fig. 13.

Prediction visualization in 700-800cm range

Fig. 14.

Prediction visualization in 800-900cm range

Fig. 15.

Prediction results with RNN model

Fig. 16.

Prediction results with LSTM model

Fig. 17.

Prediction results with GRU model

Fig. 18.

Prediction results with Bi-LSTM model

Fig. 19.

Prediction results with Bi-GRU

Fig. 20.

Prediction results with Transformer

Table 1.

Table of features

Model Feature
Basic LA, LO, SOG, COG, ΔLA, ΔLO
Distance LA, LO, SOG, COG, ΔLA, ΔLO, Distance
Sea Level LA, LO, SOG, COG, ΔLA, ΔLO, Sea Level
Extended LA, LO, SOG, COG, ΔLA, ΔLO, Sea Level, Distance

Table 2.

Overview of AIS Dataset

Category Value
Vessel Count 9,546
AIS Data Period June 1 – Nov 30, 2022
Total Data Points 2,165,170
Trajectory Count 216,517
Processed Trajectory Count 56,856
Average Trajectory Length 454.2 minutes
Vessel Types Cargo
ROI Latitude Range 31.0°N - 39.0°N
ROI Longitude Range 124.0°E - 132.0°E

Table 3.

Relevant hyperparameter settings

Hyperparameter Configure
Learning rate 0.0001
Epoch 30
Dropout 0.1
Hidden size 512
Embedding dimension 128
Input/output dimensions 2
Hidden layer 8

Table 4.

Experimental environment

System Configure
OS Ubuntu 20.04.6 LTS
CPU 2 AMD EPYC 7502 32-Core Processor
@ 2.5GHz (64 cores & 128 processors)
Memory 2TiB (32 64GiB DDR4 RAM @ 3.2GHz)
GPU NVIDIA GeForce RTX 3090

Table 5.

Prediction values with RNN

Case Metric Basic Distance Sea Level Extended
000-100 RMSE 0.013910 0.015318 0.014006 0.014416
MAE 0.008723 0.010072 0.008872 0.008995
SMAPE 0.013199 0.014962 0.013293 0.013502
FDE 0.012676 0.014050 0.013114 0.014831
FD 0.231274 0.240306 0.238566 0.256314
AED 0.013951 0.015963 0.014001 0.014230
100-200 RMSE 0.016372 0.016360 0.016542 0.016433
MAE 0.010119 0.010529 0.010533 0.010983
SMAPE 0.018003 0.018444 0.018802 0.019723
FDE 0.015385 0.015897 0.015793 0.017226
FD 0.217877 0.225124 0.225152 0.219004
AED 0.015730 0.016471 0.016368 0.017060
200-300 RMSE 0.022612 0.023262 0.023896 0.024446
MAE 0.013807 0.013905 0.014791 0.015890
SMAPE 0.027520 0.027366 0.029273 0.031806
FDE 0.022809 0.021341 0.022455 0.025509
FD 0.275489 0.278993 0.308620 0.249092
AED 0.021630 0.021749 0.022857 0.024614
300-400 RMSE 0.017431 0.017293 0.015975 0.017653
MAE 0.011324 0.011172 0.009934 0.011810
SMAPE 0.031658 0.031404 0.027838 0.033049
FDE 0.018332 0.017849 0.017574 0.018563
FD 0.327456 0.323615 0.324519 0.342065
AED 0.017878 0.017562 0.015637 0.018357
400-500 RMSE 0.018750 0.017425 0.016985 0.017363
MAE 0.012865 0.011779 0.011246 0.012134
SMAPE 0.041268 0.036895 0.036859 0.038016
FDE 0.020539 0.019137 0.019200 0.018463
FD 0.221269 0.235841 0.207095 0.215842
AED 0.020389 0.018593 0.017751 0.018873
500-600 RMSE 0.031361 0.030967 0.031023 0.032194
MAE 0.021306 0.019855 0.020639 0.022242
SMAPE 0.070834 0.066706 0.072507 0.075959
FDE 0.028253 0.025627 0.028529 0.029494
FD 0.322267 0.318610 0.309885 0.310831
AED 0.034482 0.032420 0.034425 0.035732
600-700 RMSE 0.022673 0.024311 0.023912 0.026076
MAE 0.015300 0.016948 0.016557 0.018599
SMAPE 0.045906 0.052256 0.052157 0.056437
FDE 0.026653 0.026845 0.028308 0.030026
FD 0.153577 0.158449 0.161474 0.161296
AED 0.024085 0.026692 0.026487 0.029666
700-800 RMSE 0.020893 0.023904 0.023276 0.021907
MAE 0.014010 0.016408 0.015676 0.014062
SMAPE 0.037706 0.047565 0.042993 0.040064
FDE 0.021293 0.025434 0.021946 0.021698
FD 0.218583 0.207562 0.213360 0.220217
AED 0.021323 0.024725 0.023552 0.021304
800-900 RMSE 0.028389 0.027181 0.026204 0.026546
MAE 0.021646 0.021580 0.020115 0.019585
SMAPE 0.058261 0.059517 0.054554 0.053239
FDE 0.035566 0.040459 0.035717 0.034072
FD 0.155085 0.141816 0.142385 0.181831
AED 0.032906 0.032941 0.030640 0.030066

Table 6.

Prediction values with LSTM

Case Metric Basic Distance Sea Level Extended
000-100 RMSE 0.011111 0.011519 0.012207 0.011168
MAE 0.006412 0.006666 0.007643 0.006457
SMAPE 0.009711 0.010061 0.011454 0.009739
FDE 0.009554 0.009457 0.011191 0.009267
FD 0.271613 0.284218 0.280138 0.279438
AED 0.010151 0.010528 0.012027 0.010227
100-200 RMSE 0.014145 0.014410 0.014176 0.013810
MAE 0.008748 0.008470 0.008815 0.008487
SMAPE 0.015481 0.015163 0.015579 0.015011
FDE 0.013361 0.012985 0.013971 0.013048
FD 0.236634 0.255949 0.253268 0.239884
AED 0.013657 0.013157 0.013662 0.013208
200-300 RMSE 0.021333 0.020694 0.020603 0.020017
MAE 0.012246 0.012128 0.011927 0.011523
SMAPE 0.024327 0.024284 0.023745 0.022939
FDE 0.019011 0.019670 0.019634 0.018787
FD 0.293327 0.279829 0.293429 0.268344
AED 0.019116 0.018935 0.018539 0.017945
300-400 RMSE 0.015382 0.015958 0.014627 0.014709
MAE 0.00934 0.010266 0.008097 0.008118
SMAPE 0.026357 0.029029 0.022883 0.022935
FDE 0.015909 0.017161 0.014378 0.014463
FD 0.346227 0.347670 0.351762 0.334487
AED 0.014798 0.016053 0.012698 0.012729
400-500 RMSE 0.016205 0.015913 0.015125 0.014961
MAE 0.010445 0.010475 0.008992 0.009096
SMAPE 0.033297 0.032865 0.028729 0.029337
FDE 0.016428 0.016024 0.014352 0.014477
FD 0.292882 0.282570 0.309750 0.288954
AED 0.016555 0.016475 0.014178 0.014278
500-600 RMSE 0.02977 0.027831 0.029138 0.029586
MAE 0.018162 0.016662 0.017300 0.017843
SMAPE 0.061018 0.055303 0.058833 0.062109
FDE 0.025055 0.022573 0.024011 0.025660
FD 0.338179 0.317018 0.349584 0.316666
AED 0.030010 0.027193 0.028670 0.029949
600-700 RMSE 0.021666 0.021479 0.022241 0.023262
MAE 0.014617 0.014076 0.014263 0.014485
SMAPE 0.043631 0.041505 0.043691 0.044175
FDE 0.023151 0.022908 0.024442 0.024557
FD 0.153455 0.151776 0.173000 0.162266
AED 0.022847 0.022038 0.022720 0.023065
700-800 RMSE 0.018105 0.019078 0.019530 0.019618
MAE 0.011156 0.011705 0.011492 0.011452
SMAPE 0.030037 0.031716 0.030352 0.031376
FDE 0.017904 0.018332 0.018343 0.017075
FD 0.239216 0.238083 0.231992 0.241303
AED 0.016924 0.017875 0.017457 0.017246
800-900 RMSE 0.024441 0.025707 0.024938 0.024020
MAE 0.018673 0.019996 0.018356 0.017782
SMAPE 0.048511 0.050827 0.046942 0.045632
FDE 0.032438 0.036173 0.033144 0.031158
FD 0.119148 0.167245 0.171143 0.135172
AED 0.027923 0.030097 0.027461 0.026737

Table 7.

Prediction values with GRU

Case Metric Basic Distance Sea Level Extended
000-100 RMSE 0.011359 0.012219 0.01168 0.011502
MAE 0.006663 0.007426 0.007102 0.007019
SMAPE 0.010049 0.011217 0.01072 0.010613
FDE 0.009975 0.011972 0.010541 0.011136
FD 0.273006 0.283832 0.270451 0.268829
AED 0.010616 0.011713 0.011291 0.011156
100-200 RMSE 0.01418 0.014078 0.013397 0.014075
MAE 0.009035 0.008789 0.008238 0.008846
SMAPE 0.015973 0.015628 0.014597 0.015708
FDE 0.013516 0.013691 0.012615 0.013246
FD 0.231338 0.252992 0.227385 0.240362
AED 0.014131 0.013672 0.012813 0.013625
200-300 RMSE 0.021452 0.021428 0.02116 0.021925
MAE 0.012572 0.012365 0.011868 0.012335
SMAPE 0.025024 0.024669 0.023591 0.024525
FDE 0.019995 0.019492 0.019881 0.019465
FD 0.286427 0.317762 0.311524 0.308305
AED 0.019601 0.019243 0.018602 0.019117
300-400 RMSE 0.016257 0.015801 0.015192 0.014967
MAE 0.010209 0.009694 0.008626 0.008526
SMAPE 0.028792 0.027531 0.024407 0.024226
FDE 0.016816 0.016299 0.015472 0.015524
FD 0.354195 0.337904 0.364148 0.366931
AED 0.015948 0.015282 0.01347 0.013368
400-500 RMSE 0.016603 0.016121 0.015525 0.014921
MAE 0.011007 0.01046 0.009506 0.008912
SMAPE 0.034848 0.033177 0.029878 0.028259
FDE 0.016723 0.016501 0.015646 0.014644
FD 0.24119 0.27892 0.299215 0.29613
AED 0.017282 0.016456 0.014988 0.014035
500-600 RMSE 0.029497 0.029526 0.027055 0.026967
MAE 0.01886 0.018259 0.01637 0.015868
SMAPE 0.063827 0.061961 0.055709 0.054161
FDE 0.027 0.025067 0.023091 0.023197
FD 0.30281 0.318746 0.301571 0.315975
AED 0.031079 0.029921 0.027332 0.026556
600-700 RMSE 0.023541 0.02266 0.021829 0.022054
MAE 0.015149 0.014869 0.013894 0.013814
SMAPE 0.044482 0.04536 0.041825 0.041276
FDE 0.025488 0.024463 0.023827 0.024751
FD 0.179675 0.157873 0.172212 0.175783
AED 0.023878 0.023435 0.022117 0.022014
700-800 RMSE 0.020973 0.024778 0.018526 0.020472
MAE 0.012609 0.015271 0.010674 0.011754
SMAPE 0.035886 0.041424 0.029085 0.031951
FDE 0.019841 0.021036 0.017842 0.017437
FD 0.258497 0.233135 0.24668 0.258726
AED 0.019176 0.023034 0.016299 0.017803
800-900 RMSE 0.028228 0.027951 0.024085 0.025066
MAE 0.022116 0.020375 0.017769 0.017901
SMAPE 0.059621 0.056663 0.045592 0.047586
FDE 0.041067 0.037864 0.030986 0.032483
FD 0.15374 0.139938 0.150449 0.171777
AED 0.033165 0.030794 0.026732 0.026769

Table 8.

Prediction values with Bi-LSTM

Case Metric Basic Distance Sea Level Extended
000-100 RMSE 0.005662 0.005008 0.004909 0.005231
MAE 0.004263 0.003430 0.003573 0.003721
SMAPE 0.006436 0.005125 0.005371 0.005601
FDE 0.010009 0.009497 0.009214 0.009944
FD 0.104161 0.091710 0.073193 0.080524
AED 0.006744 0.005439 0.005595 0.005805
100-200 RMSE 0.007393 0.007198 0.006904 0.007397
MAE 0.004392 0.004214 0.004127 0.004436
SMAPE 0.008013 0.007571 0.007449 0.008008
FDE 0.013369 0.013646 0.013338 0.013486
FD 0.156434 0.152331 0.127791 0.160269
AED 0.006940 0.006604 0.006494 0.006901
200-300 RMSE 0.009362 0.009477 0.009205 0.009213
MAE 0.005197 0.005244 0.005042 0.004927
SMAPE 0.010524 0.01062 0.010266 0.010014
FDE 0.019071 0.019196 0.019107 0.018350
FD 0.113229 0.133155 0.111403 0.112223
AED 0.008328 0.00842 0.008122 0.007934
300-400 RMSE 0.008603 0.008186 0.008178 0.007903
MAE 0.004449 0.004248 0.003962 0.004038
SMAPE 0.013056 0.012329 0.011484 0.011692
FDE 0.016258 0.015976 0.015000 0.015562
FD 0.339225 0.335208 0.337820 0.334678
AED 0.006933 0.006624 0.006187 0.006286
400-500 RMSE 0.008661 0.009192 0.008330 0.009444
MAE 0.004673 0.004949 0.004309 0.004910
SMAPE 0.014928 0.016019 0.013874 0.016174
FDE 0.015395 0.016515 0.014061 0.016520
FD 0.210139 0.212073 0.212381 0.209049
AED 0.007326 0.007741 0.006723 0.007620
500-600 RMSE 0.018559 0.020774 0.018576 0.020135
MAE 0.008488 0.008615 0.008468 0.008510
SMAPE 0.027462 0.027401 0.027544 0.027562
FDE 0.022493 0.023719 0.023748 0.022471
FD 0.328039 0.344426 0.316284 0.344218
AED 0.014154 0.014292 0.014074 0.014223
600-700 RMSE 0.013534 0.012287 0.013868 0.012337
MAE 0.008193 0.007019 0.007954 0.006847
SMAPE 0.023621 0.020719 0.023448 0.021250
FDE 0.023764 0.020823 0.023675 0.022034
FD 0.130439 0.132312 0.167590 0.131999
AED 0.012981 0.011080 0.012697 0.010952
700-800 RMSE 0.009841 0.009067 0.009570 0.009428
MAE 0.006459 0.005036 0.006134 0.005068
SMAPE 0.016668 0.013526 0.015959 0.013274
FDE 0.019047 0.019467 0.017986 0.018942
FD 0.139839 0.145100 0.184456 0.140695
AED 0.009835 0.007795 0.009285 0.007888
800-900 RMSE 0.014995 0.013934 0.014433 0.017607
MAE 0.010142 0.009406 0.009754 0.012499
SMAPE 0.025882 0.023758 0.024569 0.031705
FDE 0.033893 0.034617 0.032967 0.039922
FD 0.096702 0.079083 0.084179 0.103064
AED 0.015535 0.014537 0.015020 0.019265

Table 9.

Prediction values with Bi-GRU

Case Metric Basic Distance Sea Level Extended
000-100 RMSE 0.004993 0.004740 0.004490 0.004633
MAE 0.003353 0.003216 0.003053 0.003217
SMAPE 0.005018 0.004819 0.004576 0.004881
FDE 0.009591 0.009027 0.008956 0.009062
FD 0.077225 0.094966 0.092101 0.078375
AED 0.005281 0.005078 0.004797 0.005050
100-200 RMSE 0.006857 0.006779 0.006645 0.006493
MAE 0.004061 0.003898 0.003900 0.003848
SMAPE 0.007324 0.007080 0.007014 0.006967
FDE 0.013358 0.013459 0.012632 0.012270
FD 0.125376 0.126693 0.129128 0.134462
AED 0.006376 0.006124 0.006151 0.006005
200-300 RMSE 0.008984 0.009439 0.008875 0.008665
MAE 0.005069 0.005360 0.004999 0.004781
SMAPE 0.010302 0.010879 0.010104 0.009707
FDE 0.018079 0.019431 0.017744 0.017414
FD 0.127260 0.130872 0.112881 0.131384
AED 0.008087 0.008540 0.007979 0.007666
300-400 RMSE 0.008158 0.008159 0.007848 0.008055
MAE 0.004276 0.004115 0.003960 0.004083
SMAPE 0.012384 0.011955 0.011370 0.011878
FDE 0.015664 0.015630 0.014765 0.014816
FD 0.337591 0.335961 0.33354 0.336546
AED 0.006642 0.006404 0.006158 0.006339
400-500 RMSE 0.008903 0.009797 0.009051 0.008380
MAE 0.004835 0.005004 0.004636 0.004383
SMAPE 0.015488 0.016481 0.015193 0.014201
FDE 0.016016 0.016761 0.014987 0.013936
FD 0.212488 0.211006 0.210694 0.211863
AED 0.007524 0.007762 0.007189 0.006835
500-600 RMSE 0.022412 0.019190 0.020820 0.019897
MAE 0.009323 0.008206 0.009123 0.008145
SMAPE 0.029814 0.026224 0.029478 0.026599
FDE 0.025398 0.022112 0.023208 0.022468
FD 0.355244 0.347116 0.362400 0.354639
AED 0.015436 0.013532 0.015070 0.013704
600-700 RMSE 0.013100 0.012180 0.013222 0.012951
MAE 0.007206 0.007003 0.007774 0.007311
SMAPE 0.021765 0.020582 0.022577 0.021640
FDE 0.023802 0.022226 0.022732 0.022596
FD 0.133211 0.128089 0.147263 0.133075
AED 0.011433 0.011033 0.012350 0.011662
700-800 RMSE 0.009113 0.009080 0.009478 0.008350
MAE 0.005030 0.005012 0.005515 0.004799
SMAPE 0.013274 0.013188 0.014151 0.013075
FDE 0.018754 0.018130 0.017797 0.016942
FD 0.160112 0.163896 0.145710 0.110868
AED 0.007782 0.007692 0.008458 0.007422
800-900 RMSE 0.014331 0.014497 0.014513 0.015085
MAE 0.009302 0.008971 0.009800 0.010442
SMAPE 0.022864 0.023508 0.024719 0.027097
FDE 0.032999 0.032014 0.032822 0.034058
FD 0.093492 0.098881 0.094290 0.100450
AED 0.014566 0.013922 0.014934 0.016050

Table 10.

Prediction values with Transformer

Case Metric Basic Distance Sea Level Extended
000-100 RMSE 0.000206 0.000099 0.000135 0.000182
MAE 0.00017 0.000085 0.000096 0.000149
SMAPE 0.00025 0.000128 0.000143 0.000214
FDE 0.000269 0.000142 0.000146 0.000203
FD 0.018805 0.006383 0.02116 0.011079
AED 0.000266 0.000125 0.000147 0.000224
100-200 RMSE 0.000126 0.000086 0.000157 0.000095
MAE 0.000102 0.000073 0.000125 0.000063
SMAPE 0.000186 0.000130 0.000211 0.000100
FDE 0.000198 0.000133 0.000211 0.000109
FD 0.006303 0.002331 0.008183 0.003527
AED 0.000167 0.000110 0.000186 0.000106
200-300 RMSE 0.000138 0.000072 0.000134 0.000064
MAE 0.000104 0.000061 0.000107 0.000044
SMAPE 0.000217 0.000116 0.000199 0.000080
FDE 0.000202 0.000107 0.000193 0.000096
FD 0.006295 0.002349 0.008181 0.003587
AED 0.000179 0.000091 0.000166 0.000072
300-400 RMSE 0.000212 0.000169 0.000166 0.000163
MAE 0.000155 0.000063 0.000086 0.000060
SMAPE 0.000417 0.000200 0.000257 0.000179
FDE 0.000263 0.000122 0.000129 0.000101
FD 0.024361 0.033138 0.041137 0.035230
AED 0.000263 0.000105 0.000135 0.000093
400-500 RMSE 0.000232 0.000132 0.000150 0.000122
MAE 0.000181 0.000089 0.000103 0.000077
SMAPE 0.000485 0.000319 0.000359 0.000263
FDE 0.000261 0.000159 0.000156 0.000142
FD 0.014966 0.012059 0.026451 0.011581
AED 0.000305 0.000146 0.000161 0.000119
500-600 RMSE 0.000221 0.000140 0.000190 0.000112
MAE 0.000163 0.000126 0.000133 0.000083
SMAPE 0.000408 0.000425 0.000456 0.000272
FDE 0.000233 0.000185 0.000189 0.000150
FD 0.007463 0.000803 0.009735 0.004063
AED 0.000278 0.000192 0.000207 0.000129
600-700 RMSE 0.000193 0.000156 0.000159 0.000136
MAE 0.000173 0.000137 0.000141 0.000106
SMAPE 0.000476 0.000442 0.000436 0.000279
FDE 0.000314 0.000211 0.000218 0.000235
FD 0.000549 0.000558 0.000501 0.000659
AED 0.000263 0.000213 0.000217 0.000166
700-800 RMSE 0.000817 0.000406 0.001562 0.000780
MAE 0.000189 0.000143 0.000189 0.000142
SMAPE 0.000529 0.000456 0.000533 0.000339
FDE 0.000297 0.000237 0.000276 0.000251
FD 0.076652 0.036948 0.150303 0.073751
AED 0.000292 0.000227 0.000299 0.000224
800-900 RMSE 0.000228 0.000215 0.000230 0.000269
MAE 0.000195 0.000177 0.000198 0.000189
SMAPE 0.000542 0.000574 0.000527 0.000424
FDE 0.000268 0.000293 0.000312 0.000356
FD 0.001335 0.000879 0.001514 0.001475
AED 0.000293 0.000282 0.000304 0.000297