Deep learning-based approach for high spatial resolution fibre shape sensing

Manavi Roodsari, Samaneh; Freund, Sara; Angelmahr, Martin; Seppi, Carlo; Rauter, Georg; Schade, Wolfgang; Cattin, Philippe C.

doi:10.1038/s44172-024-00166-8

Download PDF

Article
Open access
Published: 27 January 2024

Deep learning-based approach for high spatial resolution fibre shape sensing

Samaneh Manavi Roodsari ORCID: orcid.org/0000-0002-0633-3573¹,
Sara Freund¹,
Martin Angelmahr²,
Carlo Seppi¹,
Georg Rauter¹,
Wolfgang Schade² &
…
Philippe C. Cattin ORCID: orcid.org/0000-0001-8785-2713¹

Communications Engineering volume 3, Article number: 19 (2024) Cite this article

1175 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Fiber optic shape sensing is an innovative technology that has enabled remarkable advances in various navigation and tracking applications. Although the state-of-the-art fiber optic shape sensing mechanisms can provide sub-millimeter spatial resolution for off-axis strain measurement and reconstruct the sensor’s shape with high tip accuracy, their overall cost is very high. The major challenge in more cost-effective fiber sensor alternatives for providing accurate shape measurement is the limited sensing resolution in detecting shape deformations. Here, we present a data-driven technique to overcome this limitation by removing strain measurement, curvature estimation, and shape reconstruction steps. We designed an end-to-end convolutional neural network that is trained to directly predict the sensor’s shape based on its spectrum. Our fiber sensor is based on easy-to-fabricate eccentric fiber Bragg gratings and can be interrogated with a simple and cost-effective readout unit in the spectral domain. We demonstrate that our deep-learning model benefits from undesired bending-induced effects (e.g., cladding mode coupling and polarization), which contain high-resolution shape deformation information. These findings are the preliminary steps toward a low-cost yet accurate fiber shape sensing solution for detecting complex multi-bend deformations.

Learning to sense three-dimensional shape deformation of a single multimode fiber

Article Open access 25 July 2022

Single-ended recovery of optical fiber transmission matrices using neural networks

Article Open access 18 October 2023

All-fiber high-speed image detection enabled by deep learning

Article Open access 17 March 2022

Introduction

Fiber optic shape sensing has proven to have great potential, especially in medical applications such as catheter navigation, surgical needle tracking, and flexible endoscope navigation. Compared to other common navigation technologies (e.g., optical trackers, electromagnetic sensors, or medical imaging), fiber shape sensing has many advantages, such as immunity to electromagnetic fields, bio-compatibility, and high flexibility. Fiber shape sensors are small in diameter, easily integrable into flexible instruments, and require no line-of-sight. Distributed sensors based on multicore fibers can also provide high-resolution shape measurements^1,2.

Fiber shape sensors measure off-axis strain, which is then used to compute the directional curvature and reconstruct the sensor’s shape³. Various fiber sensor configurations have been investigated for off-axis strain measurement, including multicore fibers with^4,5,6 or without^7,8,9 fiber Bragg gratings (FBG) in their cores, fibers with cladding waveguide FBGs¹⁰, and fiber bundles made from multiple single-mode fibers that contain FBG arrays^{11,12,13,14,15}. Accurate shape reconstruction necessitates high spatial resolution in off-axis strain measurement. With a distributed fiber shape sensor, sub-millimeter spatial resolution can be achieved¹. However, these sensors require the use of specialized and costly optical reflectometers to analyze the back-scattered light and retrieve strain variations^16,17,18,19. Moreover, the signal-to-noise ratio of the back-scattering trace in such sensors depends on the spatial resolution and the level of applied strain. Quasi-distributed sensors, on the other hand, have more cost-effective readout unit systems (e.g., FBG interrogators). However, their spatial resolutions are limited by the low sensing plane density^4,20, making them inapplicable for tracking complex shape deformations. Therefore, there is a need for a cost-effective, high-resolution, and accurate fiber shape sensing technique.

Among cost-effective fiber shape sensors interrogated in the spectral domain, eccentric FBG (eFBG) sensors show great capacity for tracking applications, thanks to their unique sensing mechanism^21,22,23. Each sensing plane in eFBG shape sensors consists of three highly localized FBGs, written off-axis in the fiber’s core (also known as edge-FBG triplet), as shown in Fig. 1a²¹. Shape deformations are commonly computed from the displacement of the fundamental mode-field inside the optical fiber, estimated through spectral intensity modifications (see Fig. 1b, c)^21,22. This approach is known as the mode-field displacement method (MFD). However, several other effects, including bending-sensitive mode coupling^24,25,26,27, polarization-dependent losses^{28,29,30,31,32}, and wavelength-dependent bending losses^{33,34,35,36,37,38,39}, also modify the spectral profile of eFBGs. These effects cannot be accurately modeled, and their impact on the sensor’s spectra is indistinguishable from the mode-field displacements. Further details on the eFBG configuration, sensing mechanism, and bending-induced effects are provided in “Methods”.

**Fig. 1: Fiber Bragg grating (FBG) configuration and working principle of the eccentric FBG (eFBG) sensor.**

In this paper, we introduce an end-to-end data-driven modeling technique based on deep learning (DL) that effectively identifies meaningful patterns in the eFBG signal, even in the presence of uncontrolled bending-induced effects. By incorporating these additional sources of information, our technique considerably improves the accuracy of shape prediction. Moreover, our approach enables high spatial resolution shape estimation directly from the eFBG sensor’s signal, eliminating the need for strain measurement, curvature computation, and shape reconstruction steps.

Results and discussion

Training and testing datasets

The eFBG fiber sensor used in this work is 30 cm long and consists of five sensing planes separated by 5 cm from each other. At each sensing plane, three off-axis FBGs are inscribed at a radial distance of approximately 2 μm from the top, left, and right sides of the fiber’s core. The dataset used for developing the DL-based model is collected using a similar setup reported in our previous work⁴⁰ (see “Methods” for more detail). We used three normalized spectral scans that were consecutively measured as input data to the proposed DL model. Each scan was recorded from 800 to 890 nm, comprising 190 wavelength components. The target data are the relative coordinates of 20 discrete points (reflective markers of the tracking system) measured over the length of the shape sensor (more detail on data preprocessing is available in⁴¹). This dataset consists of approximately 58,000 samples collected during 30 min of random movement of the fiber sensor. To evaluate the predictive performance of the trained model in an unbiased way, the samples were first shuffled and then split into Train-Validation-Test subsets, with 80% used for training, 10% for validating, and 10% for testing. In the remainder of this paper, we refer to this testing dataset as Test₁. A separate set of data, denoted as Test₂, consisting of approximately 5800 samples, was recorded to evaluate the performance of the trained model for unseen shapes resulting from continuous movement. Additionally, we collected 320 samples, referred to as Test₃, in which specific sensor regions were bent. Further details are provided in the Methods section.

Neural network design

The DL model needs a specially designed network architecture to extract essential features from the sensor’s spectra and to accurately predict its corresponding shape. In this study, we employed an optimization algorithm inspired by the Hyperband optimizer⁴² to fine-tune the network’s hyperparameters. These hyperparameters, which cannot be directly determined from the training data, play a crucial role in model performance. Figure 2 illustrates the architecture of the best-performing configuration achieved after hyperparameter tuning (see “Methods” for further details).

**Fig. 2: Architecture of the best-performing configuration after hyperparameter tuning.**

Shape prediction evaluation

We evaluated the performance of the DL approach using the three testing datasets and compared it with the MFD method. It should be noted that the density of sensing planes in our eFBG shape sensor is insufficient for the MFD method to accurately estimate complex deformations. Nevertheless, we conducted this test to highlight the superiority of the proposed data-driven technique (the DL method).

Table 1 presents the shape error metrics, including the tip error, that is, the Euclidean distance between the true and the predicted coordinate of the sensor’s tip and the root-mean-square of the Euclidean distance (RMSE) between the true and the predicted coordinates of the discrete points along the sensor’s length. When using the Test₁ dataset, the MFD approach yielded a median tip error of 111.3 mm with an interquartile range (IQR) of 121.5 mm. These error values were reduced to 98.5 and 46 mm when using the Test₂ dataset. The performance difference can be attributed to the fact that the Test₁ dataset contains more diverse shapes as the samples are randomly selected from a larger dataset, whereas Test₂ represents continuous sensor movement over a shorter period. As expected, the error values are considerably high across all testing datasets since there is too little information available for the MFD approach to estimate complex shape deformations accurately.

Table 1 Shape evaluation errors in mode-field displacement (MFD) and deep-learning (DL) methods using test sets Test₁, Test₂, and Test₃.

Full size table

The DL method, on the other hand, considerably improved the accuracy of shape prediction for Test₁ samples, resulting in a median tip error of 2.1 mm with an IQR of 2.6 mm. These values increased to 17.1 mm and 12.6 mm on the less diverse Test₂ samples. This is because the DL model can only learn to extract the most general and relevant features from the input signal when the training dataset adequately represents the expected sensor signals. However, in the case of the Test₂ dataset, less than 2% of the samples have at least 100 similar examples in the training data. To measure similarity, we employed a maximum RMSE threshold of 5 mm after evaluating various thresholds. This indicates that the 30 min of manual shape manipulation is insufficient to cover the full working space of the sensor and to create a representative training dataset for the model to generalize effectively. On the other hand, in the Test₁ dataset, almost 20% of the samples have at least 100 similar examples in the training dataset. This means that the DL method is being tested on samples that the model has already learned to handle, simulating a situation where the training dataset represents the expected sensor shapes.

The shape evaluation results of the Test₁ dataset define the lower performance limit for our model. Such performance difference also suggests that the DL model is better trained as application-specific, since it can focus more effectively on relevant features when learned from the expected shape distribution of the sensor. On the other hand, when training data covers a wide range of expected behaviors from the sensor, the DL model may simply “memorize” the corresponding shape for each signal without searching for relevant features in the sensor’s spectrum. To investigate this further, we compared the performance of our DL method with a dictionary-based algorithm. In this approach, a pre-defined dictionary was created using all training and validation samples. The shape prediction was then made by finding the closest spectrum to the test sample and presenting its corresponding shape. This technique is equivalent to the k-nearest neighbors (kNN) algorithm with a k value of 1. The median tip errors for the Test₁ and Test₂ datasets using this dictionary-based algorithm are 5.9 and 50.0 mm, with IQR values of 3.9 and 43.3 mm, respectively. We also evaluated the kNN algorithm with k values of 3, 5, 7, and 9, which resulted in median tip errors of 6.4, 7.8, 9.1, and 10.1 mm for the Test₁ dataset and 47.3, 46.1, 45.4, and 44.8 mm for the Test₂ dataset, respectively. All error values are higher compared to the errors obtained using our DL technique. This shows that our DL model generalizes well and provides more accurate shape predictions.

Two essential factors have to be considered when working with dictionaries: the size of the dictionary and the execution time required to find the best matching example. To obtain an accurate shape estimation for a given sample, the dictionary should contain a sufficiently large number of stored samples to cover all possible examples, which leads to a long execution time. Thus, there is a trade-off between accuracy and execution time when using this approach. However, extensive training data do not negatively affect the inference time in the DL method, as the resulting model size is independent of the training data size. This makes the DL method advantageous in terms of both accuracy and efficiency.

Our observations showed that the designed DL model can accurately recognize deformations even when they occur between the sensing planes. To further investigate this intriguing finding, we evaluated the shape predictions using the Test₃ dataset, in which the deformations are exclusively applied between the sensing planes. The Test₃ dataset comprises four deformation examples, each repeated twice and measured 40 times. As anticipated, the classical MFD method was not able to accurately predict the sensor’s shape for such deformations, as the deformed area was not at any of the sensing planes. In contrast, when using the DL method, we achieved a median tip error of 6 mm, which is approximately six times smaller than the median tip error obtained using MFD on this dataset. The precision of the predicted tip position in the Test₃ dataset averages at 1.9 mm.

An example from the Test₃ samples, where the sensor experienced bending between the sensing planes 3 and 4, is depicted in Fig. 3a. It is important to note that the intensity ratio of the eFBG Bragg peaks in each sensing plane can also be influenced by various factors, apart from fundamental mode-field displacements, as previously mentioned. The MFD approach, however, does not consider such effects and is thus incapable of correctly interpreting the resulting signal variations. In contrast, the DL model managed to accurately predict the sensor’s shape by considering the full spectral profile, including the minute changes occurring at wavelengths outside the Bragg resonances. Figure 3b illustrates the finite difference analysis of the loss value with respect to the 190 wavelength components of the input spectra. A higher difference indicates the greater importance of the corresponding wavelength component for shape prediction in this example. This difference provides an influence evaluation for each wavelength component of the input spectra to decode the model’s predictions (see “Methods” for detailed information). Figure 3c provides a deeper insight into this analysis. For all 190 wavelength components, the Euclidean distance between the predicted relative coordinates of each marker before and after the spectral modification is depicted through a color map. The contribution of each wavelength component to the relative coordinate prediction of all 20 markers can be discerned from the presented color map in Fig. 3c.

**Fig. 3: Decoding the deep-learning (DL) model decision for deformations between sensing planes.**

Another important finding of our study is the DL model’s ability to detect deformations occurring after the last sensing plane. Figure 4a illustrates an example in which a 3 cm long segment, 1 cm after the last sensing plane, was deformed. Similar to the example depicted in Fig. 3, the MFD method was not able to predict the sensor’s shape in such deformations. In contrast, the DL model employed relevant features in the side slopes of the eFBG spectra to predict the correct shape (see Fig. 4b, c). This intriguing performance can be attributed to the wavelength-dependent interference between the back-reflected light from the air-glass interface at the fiber’s end tip (Fresnel reflection) and the incident downstream light occurring in the region after the last sensing plane. Deformations in this region impact interferences in two ways: first, the bending induces changes in the spectral profile of the downstream light, and second, it alters the coupling conditions between the back-reflected and the downstream lights. As a result, the measured spectra from the fiber sensor exhibit small variations, reflecting the influence of deformations on the interference pattern. More examples of the sensor’s predicted shapes using the DL and the MFD methods on datasets Test₁, Test₂, and Test₃ are provided in Supplementary Movies 1–3, respectively.

**Fig. 4: Decoding the deep-learning (DL) model decision for deformations after the last sensing plane.**

Optimum number of sensing planes

A key factor in eFBG sensors when employing the MFD method is the number of sensing planes for detecting shape deformations. As with any other quasi-distributed shape sensor, the spacing between the sensing planes determines the sensor’s spatial resolution in shape measurements. When dealing with complex shape deformations, a limited number of sensing planes (resulting in low spatial resolution) can lead to large tip errors in methods that involve shape reconstruction (e.g., the MFD method). In this section, we present a theoretical analysis to determine the minimum number of sensing planes required in eFBG sensors when employing the MFD method to achieve the same level of shape prediction accuracy as attained by our DL method using five sensing planes.

In this theoretical analysis, we simulated the shape reconstruction error for different spatial resolutions. Our methodology involves interpolating the discrete curve points along the sensor’s true shape, measured by the motion capture system, using a Spline with a resolution of 0.1 mm (this value was chosen empirically). Subsequently, we compute the curvature and torsion—representing the curve’s deviation from the osculating plane—at the query points. By utilizing the computed curvatures and bending directions at the sensing planes, we reconstructed the spatial curve and compared it with the true shape.

For a 25 cm long sensor with 50 mm spatial resolution (equivalent to five sensing planes), the median tip error of the reconstructed shapes, evaluated using the Test₁ and Test₂ datasets, is approximately 50 mm. This error is nearly 16 times higher compared to the performance achieved by the DL approach (see Table 1). In order to achieve a median tip error of 3 mm, a similar spatial resolution is necessary, implying that the MFD method would require approximately 84 sensing planes consisting of 252 eFBGs.

Conclusion

In this paper, we developed a fiber shape sensing mechanism with a data-driven technique, eliminating the need for off-axis strain measurement and curvature computation at discrete points along the fiber sensor to estimate its 3D shape. Our approach utilizes an easy-to-fabricate eFBG sensor combined with a simple and cost-effective readout unit. We designed an end-to-end DL algorithm that can learn directly from the sensor’s signal to predict its corresponding shape. We extensively evaluated the shape prediction accuracy of our designed model (the DL method) in various testing conditions and compared it with an exemplary experiment, the MFD method. Our findings highlight that the spatial resolution of off-axis strain measurement in FBG-based (quasi-distributed) shape sensors is the main limitation, as the deformations between the sensing planes are not detected in complex shapes. However, our DL method compensates for this limitation by utilizing the full spectrum of our eFBG sensor, including the Bragg resonance’s side slopes, to predict complex shape deformations.

We believe that the DL model exploits the impact of bending-induced phenomena, including cladding mode coupling, bending-loss oscillations, and polarization-dependent losses, as additional sources of information to overcome the spatial resolution limitation for detecting complex deformations. As a result, there is no need to modify the fiber sensor design or its interrogation system to mitigate the impact of these bending-induced phenomena. Our developed DL method considerably reduces the shape prediction error for 3D curves within a curvature range of 0.58–33.5 m⁻¹, achieving a reduction factor of approximately 50 compared to the MFD method. Moreover, we demonstrated that the designed DL model generalizes nicely, as its performance surpasses that of a dictionary-based algorithm by a factor of two. Importantly, our proposed shape sensing solution offers a cost-effective alternative, being 30 times less expensive than commercially available distributed fiber shape sensors while maintaining a similar level of accuracy.

In summary, our research presents a promising approach to fiber shape sensing by combining an easy-to-fabricate eFBG sensor, a data-driven DL model, and the exploitation of bending-induced phenomena. We believe that this work has the potential to drive advancements in efficient and cost-effective shape sensing across various applications.

Methods

Working principle of eFBG sensor

When the eFBG sensor undergoes bending, the field distribution of the fundamental mode shifts away from the center of the fiber core^21,22,23 (see Fig. 1b). Displacements of the mode-field’s centroid lead to intensity changes in the reflected signal from the eFBGs²¹. From the intensity ratio between the eFBGs at each sensing plane, the directional curvature is computed and interpolated at small arc elements to reconstruct the 3D shape of the sensor²¹. For the sake of simplicity, this approach assumes that no other physical phenomena occurring inside a bent optical fiber influence the intensity ratio between the eFBGs within the same sensing plane.

However, positioning FBGs away from the core axis breaks the cylindrical symmetry of the fiber, which increases coupling from the core mode to the cladding modes^24,25. The strength of this mode coupling varies when the fiber is bent, as it affects the overlap integral between the interacting modes^24,26. Bending an optical fiber causes strain-induced refractive index changes and displaces the intensity distribution of the propagating light^22,43, which directly influences the coupling efficiency. Therefore, the intensity of the cladding modes changes when the fiber is bent. In eFBGs, the formation of cladding-mode resonances in fiber gratings enables highly sensitive full-directional bending response through simple light intensity measurements²⁷. Although cladding modes are typically stronger in stripped fibers or fibers with lower refractive index coatings than the cladding layer^24,25, they have also been observed in standard fibers coated with higher refractive index materials⁴⁴. Any recoupling between the excited cladding resonances and the fundamental mode affects the relative intensity values between the eFBGs.

FBG interrogators used for quasi-distributed sensors typically consist of a broadband light source (e.g., super luminescent diode (SLED)) and a grating-based spectrometer. The emitted light from SLEDs is partially polarized, meaning that it undergoes wavelength-dependent polarization changes²⁸ when propagating through a birefringence medium, such as a bent fiber^29,30,31,32. Additionally, the efficiency of the spectrometer grating is sensitive to polarization, leading to polarization-dependent losses that affect the spectral profile. Consequently, the measured intensity ratio between the Bragg peaks is modified. The impact of polarization in intensity-based fiber sensors is often mitigated by using a polarization scrambler to randomize the polarization state or by employing polarization-insensitive spectroscopy instruments.

It is well known that light power loss increases when optical fibers bend^33,45. This bending loss is typically observed as spectral modulations caused by coherent coupling between the core mode and the radiated field reflected by the cladding-coating and the coating-air interfaces (commonly referred to as whispering gallery modes)^34,46. The reflected field at the coating-air boundary causes short-period modulations due to the longer re-injection path^34,46, while reflections at the closer cladding-coating interface cause long-period resonances^35,36,37,46. It is important to note that these bending-induced attenuation losses are also influenced by temperature variations. Temperature changes affect the refractive index of the coating layer, thereby influencing the coupling between the core and the cladding whispering gallery modes³⁸. Several models have been proposed to evaluate the peak positions and shapes of bending losses^35,36,39. The strong wavelength dependence of bending losses further complicates the design of intensity-based sensors⁴⁶ as it modulates the spectral profile and affects the intensity ratio at the Bragg peaks of the eFBGs within the same sensing plane.

Setup

The data acquisition setup used for developing the DL-based model is depicted in Fig. 5. We used a cost-effective FBG interrogator (MIOPAS GmbH, Goslar, Germany) consisting of an uncooled transmit optical sub-assembly (TOSA) SLED module and a near-infrared (NIR) micro-spectrometer with a resolution of 0.5 nm. This setup allowed us to capture the spectra of the sensor across all 15 Bragg wavelengths, ranging from 813 to 869 nm. We recorded the sensor’s spectra at random curvatures and orientations (within a curvature range of 0.58–33.5 m⁻¹) while monitoring the reflective markers attached to the 30 cm long sensor using a motion capture system (Oqus 7+, Qualisys AB, Sweden). The data acquisition duration was 30 min for the Test₁ and 3 min for the Test₂ datasets. The acquisition rates in the FBG interrogator and the motion capture system were 75 and 200 Hz, respectively. The sensor’s spectra and the coordinate values corresponding to its shape were synchronized with a tolerance of less than 3 ms.

In addition, we used a laser-cut curvature template (Fig. 5) to collect 320 samples for the Test₃ dataset, where specific regions of the sensor were intentionally bent. The curvature template features four grooves, enabling us to bend the sensor at the middle 30 mm area between the sensing planes 2 and 3, 3 and 4, 4 and 5, and 10 mm after the last sensing plane with a bending radius of 50 mm.

Training setup

The search space we defined for tuning the network’s hyperparameters consists of the number of 1D convolutional layers (Conv1D), the number of fully connected layers (FC), the layer settings, the choice of batch normalization (BN) and downsampling, training settings, and loss function parameters. The search criteria are outlined in Table 2.

Table 2 The search criteria for hyperparameter optimization.

Full size table

In the designed network (Fig. 2), input samples with a batch size of 256 are first batch normalized and then fed into a Conv1D layer with 16 channels, followed by a max pooling layer with a kernel size of 3 and a stride of 2. The second Conv1D layer also has 16 channels, followed by a max pooling layer with a kernel size of 2. The third Conv1D layer has 32 channels, followed by a max pooling layer with a kernel size of 3 and a stride of 2. The fourth Conv1D layer also has 32 channels with a stride of 2, followed by a max pooling layer with a kernel size of 3. The last Conv1D layer has 256 channels, followed by batch normalization and a max pooling layer with a kernel size of 2 and a stride of 2. The extracted features are flattened to a 2048-long vector, fed into 5 FC layers, each with 2000 units. The first FC layer is followed by batch normalization, a dropout layer with a probability of 0.37, and two more FC layers. A batch normalization, an FC layer, a dropout layer with a probability of 0.16, and a fifth FC layer are the remaining layers before the final layer. The last layer is an FC layer that maps the output of the fifth FC layer into the target values, the relative coordinates. In all layers of this network architecture, the rectified linear unit (ReLU) serves as the activation function, and the kernel size for the Conv1D layers is 3. In this model, the Adam optimizer with a learning rate of 0.0001 minimizes the SmoothL1 loss function with a threshold of 4.04.

Decoding the model’s decisions

Inspired by the concept of Gradient-weighted Class Activation Mapping (Grad-CAM), we decoded the decisions made by our CNN (convolutional neural network)-based model. By decoding our model’s decisions, we gained insights into which parts of the input spectra contribute to coordinate predictions. Grad-CAM is a widely used technique in image classification tasks that generates visual explanations from any CNN-based model without requiring re-training or architectural modifications. The gradient is a measure that shows the effect on the output caused by the input, indicating the part of the input with the highest impact on the model’s output.

However, the gradient heat map produced by the last Conv1D layer has limited resolution due to the small output dimension in each channel. Therefore, instead of the gradient of the Conv1D layers, we computed the forward finite difference of the model’s loss with respect to the input spectral elements. The spacing constant was chosen to be 0.1, higher than the spectral intensity noise level. In this method, we modified the intensity value of one spectral element and observed the resulting changes in the model’s loss value. We repeated this process for all 190 spectral elements. The resulting color maps are illustrated in Figs. 3b and 4b, representing the impact of the changes in each spectral element on the model’s SmoothL1 loss value. To analyze the contribution of each spectral element to the coordinate prediction of individual markers, we computed the Euclidean distance between the predicted coordinates of each marker before and after spectral modification. This allowed us to identify the spectral elements contributing to the relative coordinate prediction of each marker. By highlighting these spectral elements, we gained a better understanding of the factors influencing the model’s predictions.

Data availability

The datasets generated during and/or analyzed during the current study are available in the Academic Torrents repository.

Code availability

The source code is available on GitHub.

References

Soller, B., Wolfe, M. & Froggatt, M. Polarization resolved measurement of rayleigh backscatter in fiber-optic components. In National Fiber Optic Engineers Conference, NWD3 (Optica Publishing Group, 2005).
Meng, Y. et al. Shape sensing using two outer cores of multicore fiber and optical frequency domain reflectometer. J. Lightw. Technol. 39, 6624–6630 (2021).
Article Google Scholar
Marowsky, G. Planar waveguides and other confined geometries: Theory, Technology, Production, and Novel Applications, vol. 189 (Springer, New York, 2014).
Khan, F. et al. Multi-core optical fibers with bragg gratings as shape sensor for flexible medical instruments. IEEE Sens. J. 19, 5878–5884 (2019).
Article Google Scholar
Moore, J. P. & Rogge, M. D. Shape sensing using multi-core fiber optic cable and parametric curve solutions. Opt. Express 20, 2967–2973 (2012).
Article Google Scholar
Bronnikov, K. et al. Durable shape sensor based on fbg array inscribed in polyimide-coated multicore optical fiber. Opt. Express 27, 38421–38434 (2019).
Article Google Scholar
Nishio, M., Mizutani, T. & Takeda, N. Shape identification of variously-deformed composite laminates using brillouin type distributed strain sensing system with embedded optical fibers. In Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems 2008, vol. 6932, 800–808 (SPIE, 2008).
Zhao, Z., Soto, M. A., Tang, M. & Thévenaz, L. Distributed shape sensing using brillouin scattering in multi-core fibers. Opt. Express 24, 25211–25223 (2016).
Article Google Scholar
Issatayeva, A., Amantayeva, A., Blanc, W., Tosi, D. & Molardi, C. Design and analysis of a fiber-optic sensing system for shape reconstruction of a minimally invasive surgical needle. Sci. Rep. 11, 1–12 (2021).
Article Google Scholar
Waltermann, C., Doering, A., Köhring, M., Angelmahr, M. & Schade, W. Cladding waveguide gratings in standard single-mode fiber for 3d shape sensing. Opt. Lett. 40, 3109–3112 (2015).
Article Google Scholar
Manavi Roodsari, S., Freund, S., Zam, A., Rauter, G. & Cattin, P. C. Fabrication and characterization of a flexible fbg-based shape sensor using single-mode fibers. IEEE Trans. Biomed. Eng. 69, 2488–2498 (2022).
Article Google Scholar
Manavi Roodsari, S. et al. Temperature-compensated fbg-based 3d shape sensor using single-mode fibers. In Integrated Photonics Research, Silicon and Nanophotonics, JTu6C–1 (Optical Society of America, 2018).
Moon, H. et al. Fbg-based polymer-molded shape sensor integrated with minimally invasive surgical robots. In 2015 IEEE International Conference on Robotics and Automation (ICRA), 1770–1775 (IEEE, 2015).
Ryu, S. C. & Dupont, P. E. Fbg-based shape sensing tubes for continuum robots. In 2014 IEEE International Conference on Robotics and Automation (ICRA), 3531–3537 (IEEE, 2014).
Roesthuis, R. J., Janssen, S. & Misra, S. On using an array of fiber bragg grating sensors for closed-loop control of flexible minimally invasive surgical instruments. In 2013 IEEE/Rsj International Conference on Intelligent Robots and Systems, 2545–2551 (IEEE, 2013).
Eickhoff, W. & Ulrich, R. Optical frequency domain reflectometry in single-mode fiber. Appl. Phys. Lett. 39, 693–695 (1981).
Article Google Scholar
Masoudi, A. & Newson, T. P. Contributed review: Distributed optical fibre dynamic strain sensing. Rev. Sci. Instrum. 87, 011501 (2016).
Article Google Scholar
Bao, X. & Chen, L. Recent progress in distributed fiber optic sensors. Sensors 12, 8601–8639 (2012).
Article Google Scholar
Yüksel, K. Rayleigh-based optical reflectometry techniques for distributed sensing applications. In International Conference on Engineering Technologies, ICENTE’18 (Selçuk Üniversitesi, 2018).
Beisenova, A. et al. Distributed fiber optics 3d shape sensing by means of high scattering np-doped fibers simultaneous spatial multiplexing. Opt. Express 27, 22074–22087 (2019).
Article Google Scholar
Waltermann, C. et al. Multiple off-axis fiber bragg gratings for 3d shape sensing. Appl. Opt. 57, 8125–8133 (2018).
Article Google Scholar
Bao, W., Rong, Q., Chen, F. & Qiao, X. All-fiber 3d vector displacement (bending) sensor based on an eccentric fbg. Opt. Express 26, 8619–8627 (2018).
Article Google Scholar
Rong, Q. et al. Highly sensitive fiber-optic accelerometer by grating inscription in specific core dip fiber. Sci. Rep. 7, 1–9 (2017).
Article Google Scholar
Thomas, J. et al. Cladding mode coupling in highly localized fiber bragg gratings: modal properties and transmission spectra. Opt. Express 19, 325–341 (2011).
Article Google Scholar
Thomas, J. U. et al. Cladding mode coupling in highly localized fiber bragg gratings ii: complete vectorial analysis. Opt. Express 20, 21434–21449 (2012).
Article Google Scholar
Erdogan, T. Cladding-mode resonances in short-and long-period fiber grating filters. JOSA A 14, 1760–1773 (1997).
Article Google Scholar
Feng, D., Qiao, X. & Albert, J. Off-axis ultraviolet-written fiber bragg gratings for directional bending measurements. Opt. Lett. 41, 1201–1204 (2016).
Article Google Scholar
Galtarossa, A. & Menyuk, C. R. Polarization Mode Dispersion, vol. 296 (Springer, New York, 2005).
Smith, A. Birefringence induced by bends and twists in single-mode optical fiber. Appl. Opt. 19, 2606–2611 (1980).
Article Google Scholar
Ulrich, R., Rashleigh, S. & Eickhoff, W. Bending-induced birefringence in single-mode fibers. Opt. Lett. 5, 273–275 (1980).
Article Google Scholar
Kersey, A. D. et al. Fiber grating sensors. J. Lightw. Technol. 15, 1442–1463 (1997).
Article Google Scholar
Block, U. L., Digonnet, M. J., Fejer, M. M. & Dangui, V. Bending-induced birefringence of optical fiber cladding modes. J. Lightw. Technol. 24, 2336 (2006).
Article Google Scholar
Marcuse, D. Field deformation and loss caused by curvature of optical fibers. JOSA 66, 311–320 (1976).
Article Google Scholar
Faustini, L. & Martini, G. Bend loss in single-mode fibers. J. Lightw. Technol. 15, 671–679 (1997).
Article Google Scholar
Valiente, I. & Yassallo, C. New formalism for bending losses in coated single-mode optical fibres. Electron. Lett. 22, 1544–1545 (1989).
Article Google Scholar
Harris, A. & Castle, P. Bend loss measurements on high numerical aperture single-mode fibers as a function of wavelength and bend radius. J. Lightw. Technol. 4, 34–40 (1986).
Article Google Scholar
Murakami, Y. & Tsuchiya, H. Bending losses of coated single-mode optical fibers. IEEE J. Quant. Electron. 14, 495–501 (1978).
Article Google Scholar
Morgan, R., Barton, J., Harper, P. & Jones, J. Temperature dependence of bending loss in monomode optical fibres. Electron. Lett. 13, 937–939 (1990).
Article Google Scholar
Renner, H. Bending losses of coated single-mode fibers: a simple approach. J. Lightw. Technol. 10, 544–551 (1992).
Article Google Scholar
Manavi Roodsari, S. et al. Using supervised deep-learning to model edge-fbg shape sensors: a feasibility study. In Optical Sensors 2021, vol. 11772, 79–88 (SPIE, 2021).
Manavi Roodsari, S. et al. Shape sensing of optical fiber bragg gratings based on deep learning. Mach. Learn. 4, 025037 (2023).
Google Scholar
Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A. & Talwalkar, A. Hyperband: a novel bandit-based approach to hyperparameter optimization. J. Mach. Learn. Res. 18, 6765–6816 (2017).
MathSciNet Google Scholar
Shao, L.-Y., Xiong, L., Chen, C., Laronche, A. & Albert, J. Directional bend sensor based on re-grown tilted fiber bragg grating. J. Lightw. Technol. 28, 2681–2687 (2010).
Article Google Scholar
Renoirt, J.-M. et al. High-refractive-index transparent coatings enhance the optical fiber cladding modes refractometric sensitivity. Opt. Express 21, 29073–29082 (2013).
Article Google Scholar
Shi, F. et al. Miniature optical fiber curvature sensor via integration with gan optoelectronics. Commun. Eng. 1, 47 (2022).
Article Google Scholar
Morgan, R., Barton, J., Harper, P. & Jones, J. D. Wavelength dependence of bending loss in monomode optical fibers: effect of the fiber buffer coating. Opt. Lett. 15, 947–949 (1990).
Article Google Scholar

Download references

Acknowledgements

We gratefully acknowledge the funding of this work by Werner Siemens Foundation through the MIRACLE project. The authors express their appreciation to Yi Jiang for performing the eFBG calibration.

Author information

Authors and Affiliations

Department of Biomedical Engineering, University of Basel, Hegenheimermattweg 167C, Allschwil, 4123, Switzerland
Samaneh Manavi Roodsari, Sara Freund, Carlo Seppi, Georg Rauter & Philippe C. Cattin
Department of Fiber Optical Sensor Systems, Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute, HHI, Am Stollen 19H, Goslar, 38640, Germany
Martin Angelmahr & Wolfgang Schade

Authors

Samaneh Manavi Roodsari
View author publications
You can also search for this author in PubMed Google Scholar
Sara Freund
View author publications
You can also search for this author in PubMed Google Scholar
Martin Angelmahr
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Seppi
View author publications
You can also search for this author in PubMed Google Scholar
Georg Rauter
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Schade
View author publications
You can also search for this author in PubMed Google Scholar
Philippe C. Cattin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors participated in the discussions and contributed to the completion of this paper. S.M.R. designed and built the experimental setup, conducted experiments, implemented the deep learning model, and, in collaboration with P.C.C., analyzed the results. C.S. implemented the hyperparameter optimization algorithm. M.A. and W.S. supplied the eFBG fiber sensor and validated the analytical MFD approach, serving as the baseline for sensor evaluation. S.F., G.R., and P.C.C. provided supervision throughout the entire research process. S.M.R. and P.C.C. wrote the paper with input from the other co-authors.

Corresponding author

Correspondence to Samaneh Manavi Roodsari.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Mengying Su. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Description of Additional Supplementary Files

Supplementary Movie 1

Supplementary Movie 2

Supplementary Movie 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Manavi Roodsari, S., Freund, S., Angelmahr, M. et al. Deep learning-based approach for high spatial resolution fibre shape sensing. Commun Eng 3, 19 (2024). https://doi.org/10.1038/s44172-024-00166-8

Download citation

Received: 24 May 2023
Accepted: 15 January 2024
Published: 27 January 2024
DOI: https://doi.org/10.1038/s44172-024-00166-8