18 - 22 August 2024
San Diego, California, US
Post-deadline submissions will be considered for the poster session, or oral session if space becomes available

This conference is intended to provide a forum for interchange on various algorithms, systems, sensors, and architectures for novel applications in optics and photonics in information processing. Original unpublished contributions reporting recent advances in analog and hybrid optical information systems and techniques are solicited. In addition, papers related to curriculum development, review papers, and teaching in a cross-disciplinary setting in any of the following topics are welcome. All abstracts will be reviewed by the program committee for originality and merit. Topics of interest include, but are not limited to, the following:

Algorithms
New Architecture and Systems
Optical Switching and Interconnects
Digital Optical Processing and Architecture
Wavefront-based Computation
Applications in Biophotonics
Image Forming and Processing Applications
Optical Information Processing Around the Globe
Optics and Photonics with a focus on sustainability ;
In progress – view active session
Conference 13136

Optics and Photonics for Information Processing XVIII

21 August 2024
View Session ∨
  • Poster Session
  • Signal, Image, and Data Processing Keynote
  • Optical Engineering Plenary
  • 1: Artificial Intelligence and Algorithms
  • 2: Photonics Systems and Devices for Information Processing
  • 3: Imaging Systems and Autonomous Navigation
  • 4: Algorithms and Imaging
  • Featured Nobel Plenary
Information

Want to participate in this program?
Post-deadline abstract submissions accepted through 20 June. See "Additional Information" tab for instructions.

Poster Session
19 August 2024 • 5:30 PM - 7:00 PM PDT
Conference attendees are invited to attend the poster session on Monday evening. Come view the posters, enjoy light refreshments, ask questions, and network with colleagues in your field. Authors of poster papers will be present to answer questions concerning their papers. Attendees are required to wear their conference registration badges to the poster sessions.

Poster Setup: Monday 10:00 AM - 4:30 PM
Poster authors, view poster presentation guidelines and set-up instructions at https://spie.org/OP/poster-presentation-guidelines
13136-22
Author(s): Adriana R. Sánchez, Jorge Francés, Adrián Moya, Emilio J. Mena, Joan Josep Sirvent-Verdú, Eva M. Calzado Estepa, Sergi Gallego, Inmaculada Pascual, Andrés Márquez Ruiz, Univ. de Alicante (Spain)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
SLMs use PA-LCoS microdisplays for phase-only modulation, known for high resolution and small pixel size. However, phenomena like cross-talk and out-of-plane reorientation of the liquid crystal director degrade pixelated SLM device performance. As novel applications require higher resolutions and smaller pixel sizes, a numerical workflow is applied to analyse different parameters of the microdisplay, such as fill factor, pixel size, external voltage, LC director influence, and light interaction. Specifically, the analysis focuses on the impact of high-frequency binary phase gratings on radiometric and polarimetric response, setting up a framework for future microdisplays with higher resolutions and smaller pixel sizes.
13136-23
Author(s): Saeed Bohlooli Darian, Univ. of Ulsan (Korea, Republic of); Jeongmin Oh, Asan Medical Ctr. (Korea, Republic of); Jun Ki Kim, Univ. of Ulsan (Korea, Republic of), Asan Medical Ctr. (Korea, Republic of)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
Traditional microscopes face resolution constraints, and advanced techniques pose difficulties in studying multiple cells simultaneously. Additionally, exploring the dynamics of living cells within their natural organismic environment often demands sophisticated and costly equipment, which may not be affordable for every laboratory. Our approach utilizes computational techniques, such as super-resolution radial fluctuations, on sequences of tissue images. Through various adjustments, we have enhanced resolution, reduced background noise, and amplified signals for subcellular structures. Despite starting with unclear images, our refined methods ensure clarity in depicting organelle structures and dynamics in the final outcomes. These tools are now accessible for research across diverse settings.
13136-24
Author(s): Juan Jose Alcalde-Castro, Alejandro Restrepo-Martínez, Giovanni Restrepo-Betancur, Univ. Nacional de Colombia Sede Medellín (Colombia)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
A microscope design was developed using 3D printing and liquid crystals to achieve variable polarization configurations and semi-automatic processes without the need for mechanical movement of polarizers and retarders. The device includes a polariscope composed of electronically controlled liquid crystals and a compliant mechanism for precise micrometric movements. Stepper motors and software control these movements, and the process of acquiring images requires connecting a sensor to a Raspberry Pi. The microscope's lens and extension tube assembly magnify the image 600 times. The study compares two polarization techniques: Stokes and ellipsometric polarization. Additionally, it analyzes pyroxylin crystals and meiotic spindles of maturing porcine oocytes. The ellipsometric technique is more effective in detecting low retards, as indicated by the results. This prototype has the potential to reduce the cost of in vitro fertilization in assisted reproduction laboratories, with a focus on animals of high genetic value.
13136-25
Author(s): Andrés García, Zaid Moroyoqui, Brandon Martínez, Ulises Orozco-Rosas, Kenia Picos, CETYS Univ. (Mexico)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
This paper presents the development of a lidar-based object classification system with the use of machine learning and signal processing. The proposal employs Support Vector Machines (SVM) to classify objects and terrain with the help of a self-made lidar which scans an area in the same way as a picture is taken. This project involves the processing of data to generate a point cloud that lets us visualize the scans taken by the lidar. Additionally two datasets were built by taking multiple scans of different terrains and objects with a cylindrical shape. This paper shows experimental results of machine learning models built around lidar acquired data and small datasets, it also shows point cloud visualizations and signal processing techniques to filter data.
13136-26
Author(s): Mario Angel Rico-Mendez, Romeo Selvas, Manuel Garcia, Univ. Autónoma de Nuevo León (Mexico); J. Manuel Sierra-Hernández, Univ. de Guanajuato (Mexico); Ricardo Chapa, Leonardo Arévalo Bautista, Norma Patricia Puente, Sheila Bazavilvazo, Univ. Autónoma de Nuevo León (Mexico); Eloisa Gallegos-Arellano, Univ. Tecnológica de Salamanca (Mexico)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
This work presents a novel fiber optic Mach-Zehnder interferometer (MZI) sensor which can measure curvature due to its structure. This structure employs a 2-section MZI filter, constructed from SMF-28 fiber using the core-offset technique. The source in the 1480-1600 nm range shows an intermodal energy interference between the core and cladding and exhibits six spacing notches for sensing applications. The sensibility obtained was of 0.00018 nm/uW when curvature is applied. On the other hand, by including a thin film based on Zinc Oxide (ZnO) and another with Aluminum doped zinc oxide (AZO) in the sensing arrangement, an increase in the sensitivity was detected with a value of 0.00015 nm/uW and 0.00016 nm/uW, respectively. This sensor can be used for applications in various fields, such as environmental monitoring, engineering, and process control.
13136-28
Author(s): Andrés Márquez Ruiz, José Carlos Vázquez, Juan Carlos Bravo, Jorge Francés, Cristian Neipp, Sergi Gallego, Manuel Ortuño, Manuel Gutiérrez, Inmaculada Pascual, Augusto Beléndez, Univ. de Alicante (Spain)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
In the last years many efforts have been invested in the development of augmented reality devices. Depending on the application different constraints need to be faced. Providing full color see-through augmented reality on eyeglasses with customized ophthalmic correction, and valid for a wide range of use cases, is one of the most challenging and ambitious applications. We are within a European Project aiming to this goal. One of the key components in this eyewear is the holographic lens mirror (HLM), acting as the beam combiner responsible for the see-through capability. Preliminary results will be given in this work.
13136-29
Author(s): Yuanyuan Ding, Shanghai Astronomical Observatory (China)
19 August 2024 • 5:30 PM - 7:00 PM PDT
Show Abstract + Hide Abstract
Aperture masking observations for binary stars have been done with a 1.56-m telescope at Shanghai Astronomical Observatory during 2020-2024. In order to ensure ample light is available to restore the high resolution images even at short exposure times and reach the diffraction limit of the 1.56m telescope,we select some binary stars and reference stars nearby with magnitude 5-7 and angular distance 0.2-2arcsec in the WDS catalog as observation targets.This article achieves high-resolution restoration of binaries by a hybrid data processing method, including data reduction, a spatial domain method named ISA to suppress atmospheric turbulence, and OSEM to reduce image degradation caused by multi-aperture interference.The results show that this method can effectively obtain high-resolution images of binary stars, and the measured angular distance is basically consistent with the given value in WDS catalog.
Signal, Image, and Data Processing Keynote
20 August 2024 • 2:30 PM - 3:15 PM PDT
Session Chair: Khan Iftekharuddin, Old Dominion Univ. (United States)

2:30 PM - 2:35 PM:
Welcome and Opening Remarks
13136-501
Author(s): Zhi-Pei Liang, Univ. of Illinois (United States)
20 August 2024 • 2:35 PM - 3:15 PM PDT
Show Abstract + Hide Abstract
The ongoing paradigm shift in healthcare towards personalized and precision medicine is posing a critical need for noninvasive imaging technology that can provide quantitative tissue and molecular information. Magnetic resonance signals from biological systems contain information from multiple molecules and multiple physical/biological processes (e.g., T1 relaxation, T2 relation, diffusion, perfusion, etc.). So, magnetic resonance imaging (MRI) is inherently a high-dimensional imaging technology that can acquire structural, functional and molecular information simultaneously. In practice, due to the curse of dimensionality, MRI experiments are often done in a low-dimensional setting to acquire biomarkers one at a time. Such a “divide-and-conquer” approach not only reduces data acquisition efficiency but also makes it difficult to obtain molecular information in high resolution. By synergistically integrating machine learning with sparse sampling, constrained image reconstruction and quantum simulation, we have successfully demonstrated ultrafast high-dimensional imaging of the brain. This talk will give an overview of this unprecedented omni imaging technology and show some exciting experimental results of brain function and diseases.
Optical Engineering Plenary
20 August 2024 • 3:30 PM - 5:35 PM PDT
3:30 PM - 3:35 PM:
Welcome and Opening Remarks
13138-501
Author(s): Manuel Gonzalez-Rivero, Maxar Technologies (United States)
20 August 2024 • 3:35 PM - 4:15 PM PDT
Show Abstract + Hide Abstract
With 140+ petabytes of historical data holdings, 3.8 million square kilometers of daily multi-spectral collection, integration of Synthetic Aperture Radar and newly launching assets every quarter, the opportunities to develop insight from sense making technologies at Maxar are ever growing. During this discussion, we will cover the challenges of collecting, organizing, and exploiting multi source electro-optical remote sensing systems at scale using modern machine learning architectures and techniques to derive actionable insights.
13131-501
Author(s): Nelson E. Claytor, Fresnel Technologies Inc. (United States)
20 August 2024 • 4:15 PM - 4:55 PM PDT
13145-501
Author(s): Jeremy S. Perkins, NASA Goddard Space Flight Ctr. (United States)
20 August 2024 • 4:55 PM - 5:35 PM PDT
Session 1: Artificial Intelligence and Algorithms
21 August 2024 • 8:00 AM - 10:00 AM PDT
Session Chair: Victor Hugo Diaz-Ramirez, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico)
13136-1
Author(s): Kwame Ampofo, Megan A. Witherow, Alex Glandon, Monibor Rahman, Ahmed Temtam, Mecit Cetin, Khan M. Iftekharuddin, Old Dominion Univ. (United States)
21 August 2024 • 8:00 AM - 8:20 AM PDT
Show Abstract + Hide Abstract
.This study introduces innovative deep learning techniques for automated flood extent and depth estimation on roadways, addressing the challenges posed by recurrent nuisance flooding. Leveraging advancements in computing and ubiquitous sensing, the proposed methods offer cost-effective solutions compared to traditional monitoring approaches. To overcome the scarcity of annotated data, a novel pipeline is devised to automatically generate training data using car image datasets, simulating various floodwater levels. The developed model achieves impressive performance metrics, including an F1 score of 0.9 for flood extent estimation and 74.85% accuracy, with an F1 score of 0.74 for four water levels, when tested with real-world flooding images. This approach substantially reduces the time and labor associated with dataset labeling, presenting a promising avenue for image-based flood depth estimation.
13136-2
Author(s): Karen Ortiz-Ruiz, Mireya S. García-Vázquez, Instituto Politécnico Nacional (Mexico); Alejandro A. Ramírez-Acosta, MIRAL R&D&I Multimedia (United States)
21 August 2024 • 8:20 AM - 8:40 AM PDT
Show Abstract + Hide Abstract
Individual identification of sea turtles relies on manual and semi-automated methods, requiring high-quality images and analyzing the geometric morphology of facial and dorsal scales. Automatic semantic segmentation accurately delineates areas in images, enabling object detection and identification. We propose an AI model for sea turtle identification, integrating the Segment Anything Model for semantic segmentation. This approach, based on the geometric morphology of facial scales, shell, fins, and neck, achieves precise identification leveraging unique anatomical features. High precision results are supported by evaluation metrics, providing a quantitative measure of the model's accuracy.
13136-3
Author(s): Aleksandr Duplinskii, Kaden Bearne, Matthew Fillipovich, Alex Lvovsky, Univ. of Oxford (United Kingdom)
21 August 2024 • 8:40 AM - 9:00 AM PDT
Show Abstract + Hide Abstract
Spatial optical mode sorter is a promising device that can significantly contribute to communication and imaging domains. However, currently available commercial devices fall short in effectively sorting large mode bases with high fidelity, particularly in the visible range. Traditionally, such sorting has relied on multi-plane light conversion (MPLC) setups, wherein the optical field undergoes iterative bouncing between a mirror and a spatial light modulator programmed with different phase profiles. This configuration can be also thought of as a diffractive optical neural network (DONN), where each phase mask acts like a single layer. In this work we treat the mode sorter as a DONN and instead of employing iterative algorithms to calculate optimal phase masks with conventional MPLC approaches, we propose training it with machine learning algorithms. We investigate various training methodologies, including backpropagation within simulation environments, hybrid approach that incorporate experimental data, and forward-forward training strategies.
13136-4
Author(s): Luis Hector Sanchez, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico); Carlos Alberto Guerrero Peña, Facultad de Ciencias Físico-Matemáticas (Mexico); Juan Jose Tapia Armenta, Ana Victoria Ojeda, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico)
21 August 2024 • 9:00 AM - 9:20 AM PDT
Show Abstract + Hide Abstract
This work introduces a methodology for characterizing stellar objects using the photometric data from the J-PLUS mission's third data release, which does not include direct astrophysical parameters such as effective temperature, stellar radius, and luminosity. By cross-matching this data with the GAIA space mission's third release, we were able to enhance the J-PLUS catalog with these essential parameters. We then employed supervised machine learning techniques, specifically the XGBoost and Random Forest algorithms, to estimate the effective temperature, stellar radius, and luminosity for stars not catalogued by J-PLUS. This method provides a significant improvement in the characterization of the 8,013,455 stars observed in the J-PLUS mission and illustrates the potential of combining different astronomical data sources with advanced analytical techniques for stellar classification.
13136-5
Author(s): Eyitomilayo Y. Babatope, Mireya S. García-Vázquez, Instituto Politécnico Nacional (Mexico); Alejandro A. Ramírez-Acosta, MIRAL R&D&I Multimedia (United States)
21 August 2024 • 9:20 AM - 9:40 AM PDT
Show Abstract + Hide Abstract
Hands appear frequently in the field of view in egocentric vision. Hand segmentation is an important and challenging computer vision task with applications in human-computer interaction and assistive technologies. Optimization techniques were introduced to the state-of-the-art model, RefineNet to improve performance. The improved deep convolutional neural network (RefinePix-Net) was trained using egohands dataset. This model performed better when tested with publicly available hands datasets outperforming the baseline model across two metrics mean intersection over union (mIoU) and mean precision (mPrec). This improved model can be applied in object and activity recognition and robotics.
13136-6
Author(s): Jesús Fernando Franco López, Benemérita Univ. Autónoma de Puebla (Mexico); Mireya S. García-Vázquez, Instituto Politécnico Nacional (Mexico); Alejandro Á. Ramírez-Acosta, MIRAL R&D&I Multimedia (United States)
21 August 2024 • 9:40 AM - 10:00 AM PDT
Show Abstract + Hide Abstract
Using Graph Convolutional Networks (GCNs) to classify and identify segmented images into superpixels is a growing field of study. These algorithms represent the content of images by creating a graph where superpixels act as nodes. Monitoring with GCNs provides valuable insights into the geographic distribution, population density, migration, survival, reproduction, and growth of sea turtles. The selection of GCN is based on the unique characteristics of sea turtle morphology and patterns. This algorithm offers high precision and accuracy, achieved through image segmentation and leveraging turtle morphology. Additionally, it offers a significant improvement in sea turtle identification compared to other algorithms, such as Hotspotter. Our new algorithm serves as a versatile tool to aid in monitoring and protecting these species, contributing to our understanding of marine life.
Break
Coffee Break 10:00 AM - 10:15 AM
Session 2: Photonics Systems and Devices for Information Processing
21 August 2024 • 10:15 AM - 11:35 AM PDT
Session Chair: Khan M. Iftekharuddin, Old Dominion Univ. (United States)
13136-7
Author(s): Kevin Zelaya, Queens College (United States); Mohammad-Ali Miri, The Graduate Ctr., CUNY (United States)
21 August 2024 • 10:15 AM - 10:35 AM PDT
Show Abstract + Hide Abstract
This talk introduces the design and analysis of a passive and fully optical on-chip direction-finding architecture, which can operate effectively in broad or narrow detection scenarios. The proposed system is based on an equally spaced linear array of M grating couplers that steer incoming waves toward a passive and non-unitary photonic processor. This performs a processing operation from which the incident angles of the incoming waves are discretely determined at the N outputs of the photonic processor. The detection functionalities are enhanced by introducing some tracking functions implemented in a post-processing stage at the photonic unit output. The benefits and disadvantages of each tracking function are illustrated. Lastly, an approximation to reduce the photonic unit to a unitary one is discussed, rendering a compact device that operates only on a narrow detection range.
13136-8
Author(s): Sofia Esquivel-Hernandez, Rigoberto Juarez-Salazar, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico); Amalia Martínez-García, Juan Antonio Rayas-Álvarez, Centro de Investigaciones en Óptica, A.C. (Mexico); Victor H. Diaz-Ramirez, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico)
21 August 2024 • 10:35 AM - 10:55 AM PDT
Show Abstract + Hide Abstract
A multiple view fringe projection profilometer can be constructed using several camera-projector pairs at different viewpoints. Alternatively, the profilometer can be constructed using flat mirrors to create virtual instances of a single camera-projector pair. In this work, the advantages of using several camera-projector pairs and mirrors are analyzed. A profilometer with three camera-projector pairs is compared with an equivalent profilometer composed of a single camera-projector pair and two mirrors. The performance of both profilometers is analyzed experimentally by performing full object reconstruction, considering system calibration simplicity and object reconstruction accuracy. This work provides practical insights for designing profilometers specialized in full object reconstruction.
13136-9
Author(s): Adriana R. Sánchez, Jorge Francés, Adrián Moya, Emilio J. Mena, Juan Carlos Bravo, Eva M. Calzado Estepa, Sergi Gallego, Augusto Beléndez, Andrés Márquez Ruiz, Univ. de Alicante (Spain)
21 August 2024 • 10:55 AM - 11:15 AM PDT
Show Abstract + Hide Abstract
Digital backplane liquid crystal on silicon devices (LCoS) are widely used in wavefront engineering applications. A thorough study of the relation between the different control parameters and their effect on the retardance modulation is lacking though. These control parameters, which are the low and high levels of the voltage signal together with the grey level addressed, in relation with the retardance modulation produced are systematically explored across the whole visible spectrum. This methodology provides an interpolating approach to calculate the retardance values across the multidimensional control space once properly sampled. The results presented are backed by the experimental measurements provided.
13136-10
Author(s): Romil Audhkhasi, Univ. of Washington (United States); Michelle L. Povinelli, The Univ. of Southern California (United States)
21 August 2024 • 11:15 AM - 11:35 AM PDT
Show Abstract + Hide Abstract
We present an encryption system consisting of an electrically tunable metasurface and a matched detector for secure encryption of grayscale images in the 8 – 12 μm wavelength range. In the proposed scheme, the encrypted image corresponds to the spatially varying thermal intensity of the metasurface as captured by its matched detector. Using examples of single and multi-image encryption, we show that the optical properties of either the metasurface or matched detector alone do not reveal any meaningful information about the encrypted image, thereby validating the security of the proposed scheme. The electrical tunability of the metasurface provides an additional layer of security as the image can only be retrieved by operating it at a predefined voltage level. Our results provide intriguing possibilities for the development of compact and secure object tagging and anti-counterfeiting applications in the infrared.
Break
Lunch/Exhibition Break 11:35 AM - 1:05 PM
Session 3: Imaging Systems and Autonomous Navigation
21 August 2024 • 1:05 PM - 2:45 PM PDT
Session Chair: Andrés Márquez Ruiz, Univ. de Alicante (Spain)
13136-11
Author(s): Victor H. Diaz-Ramirez, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico); Leopoldo Gaxiola, Tecnológico Nacional de México, Instituto Tecnológico de Culiacan (Mexico); Juan Pablo Apun, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico); Rigoberto Juarez-Salazar, Consejo Nacional de Humanidades Ciencias y Tecnologías (Mexico), Instituto Politécnico Nacional (Mexico), Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico)
21 August 2024 • 1:05 PM - 1:25 PM PDT
Show Abstract + Hide Abstract
Disparity refinement is a post-processing step in stereo vision that retrieves unknown disparity values caused by pixel occlusions or estimation errors. This step is crucial for improving depth estimation accuracy and reducing artifacts. In this work, we propose an iterative method based on genetic optimization to perform disparity refinement for stereo vision. The estimation of unknown disparity values is formulated as an optimization problem, where a fitness function is optimized by minimizing a trade-off between disparity variations and point correspondence errors. The proposed method achieves accurate refined disparity maps for stereo depth estimation. Computer simulation results are presented and discussed in terms of objective performance measures. Additionally, the results are compared with those obtained using a well-known existing method.
13136-12
Author(s): Rigoberto Juarez-Salazar, Instituto Politécnico Nacional (Mexico); Eberto Benjumea, Andres G. Marrugo, Univ. Tecnológica de Bolívar (Colombia); Victor H. Diaz-Ramirez, Instituto Politécnico Nacional (Mexico)
21 August 2024 • 1:25 PM - 1:45 PM PDT
Show Abstract + Hide Abstract
Conventional fringe projection profilometers utilize cameras and projectors in the visible spectrum. Nevertheless, some applications require profilometers with a complementary thermal camera for the infrared spectrum. Since the point cloud is computed from pixel correspondences between the visible camera-projector pair, the texture in the visible spectrum is obtained by direct association of color from each image pixel to its corresponding point in the cloud. Unfortunately, the texture from the thermal camera is not straightforward because of the inexistence of pixel-point correspondences. In this paper, the distorted pinhole model is employed to determine the texture of the reconstructed object using thermal images. The theoretical principles are reviewed, and an experimental verification is conducted using a visible-thermal fringe projection profilometer. This work provides a helpful framework for three-dimensional data fusion for advanced multi-modal profilometers.
13136-13
Author(s): Victor H. Diaz-Ramirez, Martin Gonzalez-Ruiz, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico); Rigoberto Juarez-Salazar, Consejo Nacional de Humaidades Ciencias y Tecnologías (Mexico), Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico)
21 August 2024 • 1:45 PM - 2:05 PM PDT
Show Abstract + Hide Abstract
Binocular vision is an effective technique for depth estimation in outdoor applications. However, the performance of this technique can be affected by low-contrasted images captured in a scattering medium. This paper presents a binocular vision-based method for image dehazing and depth estimation in a scattering medium. First, the disparity map of the scene is computed from preprocessed binocular images. Next, the atmospheric parameters of the medium and scene depth are determined with a proposed statistical estimation method. Finally, undegraded images of the scene are obtained using an atmospheric optics restoration method. The theoretical foundations of the proposed approach are reviewed, and an experimental validation is presented utilizing a laboratory platform.
13136-14
Author(s): Armando Castillo Vazquez, Ulises Orozco-Rosas, Kenia Picos, CETYS Univ. (Mexico)
21 August 2024 • 2:05 PM - 2:25 PM PDT
Show Abstract + Hide Abstract
This paper presents the development of an autonomous navigation system for Unmanned Aerial Vehicles (UAVs) using visual reference. The proposal employs a Convolutional Neural Network (CNN) to classify traffic signal images, enabling UAVs to navigate in dynamically evolving environments. This research involves the configuration of the Robot Operating System (ROS) for UAV communication, the implementation of a specialized CNN for image classification, and the integration of this network into the navigation system. Additionally, algorithms will be presented for image acquisition and UAV manipulation based on CNN outputs. We present experimental results that were specially designed to demonstrate the efficiency of the proposal, in order to validate the analysis and implementation.
13136-15
Author(s): Alejandro Dumas Leon, Jorge Tomás Araujo González, Eduardo Arturo Mendoza Gomez, Ulises Orozco-Rosas, Kenia Picos, CETYS Univ. (Mexico)
21 August 2024 • 2:25 PM - 2:45 PM PDT
Show Abstract + Hide Abstract
This work presents the development of an autonomous mapping and navigation system tailored for a holonomic robot operating in unknown environments, leveraging LiDAR technology. The core of this research lies in the integration of a sophisticated Simultaneous Localization and Mapping (SLAM) algorithm with real-time LiDAR data processing, enabling the robot to generate maps while identifying its location within these maps accurately. The study involves the design and implementation of an omnidirectional mobility mechanism, the selection and calibration of LiDAR, and the implementation of efficient path-planning algorithms that consider the dynamic nature of unknown environments. We present experimental setups to demonstrate the system’s capability to navigate and map with high precision demonstrating the potential applications in areas such as autonomous exploration, search and rescue, and indoor navigation.
Break
Coffee Break 2:45 PM - 3:00 PM
Session 4: Algorithms and Imaging
21 August 2024 • 3:00 PM - 5:00 PM PDT
Session Chair: Abdul A. S. Awwal, Lawrence Livermore National Lab. (United States)
13136-16
Author(s): Daniel Alejandro Lopez Montiel, Miguel Angel Lopez-Montiel, Oscar Humberto Montiel Ross, Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico), Instituto Politécnico Nacional (Mexico); Oscar Castillo Lopez, Instituto Tecnológico de Tijuana (Mexico)
21 August 2024 • 3:00 PM - 3:20 PM PDT
Show Abstract + Hide Abstract
Integrating quantum algorithms with machine and deep learning models has emerged as a promising method for addressing medical image classification challenges. This addition can increase speed and efficiency when performing complex computations. However, hybrid quantum model research, specifically on Quantum Convolutional Neural Networks (QCNNs) poses two main drawbacks: The location of the quantum convolutional layer before the model architecture and the lack of integration of the quantum layer inside the training process. These disadvantages reduce model robustness and method reproducibility. This study explores an approach encompassing the model architecture’s quantum layer to address these shortcomings; we present a comparative analysis between a hybrid quantum deep learning model and its classical counterpart for classifying skin cancer dermoscopic images. Model performance is evaluated through accuracy, recall, F1 Score, and the confusion matrix classification metrics. The findings demonstrate the advantages of weight adjustment of a quantum layer and the potential for advancing medical image analysis.
13136-17
Author(s): Ivannia Gomez Moreno, Ulises Orozco-Rosas, Kenia Picos, CETYS Univ. (Mexico); Tajana Simunic-Rosing, Univ. of California, San Diego (United States)
21 August 2024 • 3:20 PM - 3:40 PM PDT
Show Abstract + Hide Abstract
This paper presents the implementation of a method that can effectively predict the best-fitting colorization model based on the input image. Altogether, this pipeline can produce a model with a high level of accuracy for several datasets. We show experimental results that were specially designed to demonstrate the efficiency of the model, in order to validate the analysis and implementation.
13136-18
Author(s): Martin Gonzalez-Ruiz, Victor H. Diaz Ramirez, Instituto Politécnico Nacional (Mexico); Miguel Cazorla, Univ. de Alicante (Spain); Rigiberto Juarez-Salazar, Instituto Politécnico Nacional (Mexico), Consejo Nacional de Humanidades Ciencias y Tecnologías (Mexico)
21 August 2024 • 3:40 PM - 4:00 PM PDT
Show Abstract + Hide Abstract
A crucial task in facial expression recognition is the classification of facial features in captured images. This classification task is challenging because facial features dynamically change due to facial expressions. Additionally, the captured face images are often degraded by additive noise, nonuniform illumination, geometrical modifications, and partial occlusions, increasing the classification uncertainty. Several successful methods for facial landmark classification based on machine learning have been proposed. This work presents a comparative study of existing classification methods for facial landmarks in image sequences degraded by noise, nonuniform illumination, and partial occlusions. The performance of the classification methods considered in the study is quantified in terms of accuracy, precision, and sensitivity using face images from wellknown datasets and images captured in a laboratory experiment. The study aims to provide useful insights into the efficacy of existing facial landmark classification methods under challenging conditions.
13136-19
Author(s): Arturo Rios Ramos, Mireya S. García-Vázquez, Alejandro A. Ramírez Acosta, Instituto Politécnico Nacional (Mexico)
21 August 2024 • 4:00 PM - 4:20 PM PDT
Show Abstract + Hide Abstract
This work introduces an AI-based approach for individual sea turtle identification using Photo-ID. Comparing a self-developed algorithm with the HotSpotter ID, a well-known method for semi-automated identification of individual sea turtles within populations, highlighting differences in image preprocessing and One-vs-Many matching evaluation. Tested on a diverse database of sea turtle images. The database images vary in environments, light conditions, qualities and resolutions, at different distances, and angles. The proposed Photo-ID method involves automatic detection of the turtle’s head using a modified YOLOv5 network, extraction of pixels to emphasize head scale patterns, and the ORB algorithm for identification. With standard quality images, the proposed algorithm achieves 98% accuracy, outperforming the HotSpotter’s 96%. This approach enhances the identification process, contributing to endangered species protection and addressing the challenges associated with the time-consuming nature of Photo-ID methods.
13136-20
Author(s): Juan Esteban García, Alejandro Restrepo-Martínez, Laura Álvarez-Gil, Univ. Nacional de Colombia Sede Medellín (Colombia)
21 August 2024 • 4:20 PM - 4:40 PM PDT
Show Abstract + Hide Abstract
Hyperthermia is a cancer treatment that utilizes magnetic nanofluidics and magnetic induction. The experimental evaluation techniques used to validate heat transfer and changes in thermal diffusivity include the use of phantom models to simulate human tissue and infrared thermography. The present study applies phantom models with magnetic inserts, which pose challenges in determining temperature distribution due to their location and depth, affect thermal contrast. Processing thermograms is crucial for identifying areas of interest, which enables the development of new hyperthermia evaluation techniques.
13136-21
Author(s): Daniel-Ruben Garcia-Avila, Ciro Andres Martinez-Garcia-Moreno, Sergio-Jesus Gonzalez-Ambriz, CITEDI-Instituto Politécnico Nacional (Mexico)
21 August 2024 • 4:40 PM - 5:00 PM PDT
Show Abstract + Hide Abstract
Water consumption monitoring is a critical task to address for controlling water usage and reducing unnecessary water waste. In this sense, water supply companies and government agencies struggle to identify water leaks and perform data analytics for predictive decision-making due to a lack of real-time monitoring systems, inaccurate metering, and aging infrastructure. In this paper, we propose an Internet of Things (IoT) and machine learning (ML)-based solution for residential water consumption monitoring and leak detection. The IoT network was implemented using two smart water meters (SWMs) connected to a LoraWAN gateway (GW), which sends data to The Thing Network (TTN) and the Amazon Web Services (AWS) cloud platform for storage, management, processing, ML algorithm configuration, and visualization. The SWMs were connected to residences located in two zones of the city of Tijuana. Different machine learning algorithms were analyzed and compared using different metrics such as Accuracy, Recall, Precision, and F1 Score for leak detection. A data visualization interface was developed to visualize the data and perform predictive analysis.
Featured Nobel Plenary
21 August 2024 • 5:00 PM - 5:45 PM PDT
Session Chair: Jennifer Barton, The Univ. of Arizona (United States)

5:00 PM - 5:05 PM:
Welcome and Opening Remarks
13115-501
The route to attosecond pulses (Plenary Presentation)
Author(s): Anne L'Huillier, Lund Univ. (Sweden)
21 August 2024 • 5:05 PM - 5:45 PM PDT
Show Abstract + Hide Abstract
When an intense laser interacts with a gas of atoms, high-order harmonics are generated. In the time domain, this radiation forms a train of extremely short light pulses, of the order of 100 attoseconds. Attosecond pulses allow the study of the dynamics of electrons in atoms and molecules, using pump-probe techniques. This presentation will highlight some of the key steps of the field of attosecond science.
Conference Chair
Old Dominion Univ. (United States)
Conference Chair
Lawrence Livermore National Lab. (United States)
Conference Chair
Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico)
Conference Co-Chair
Univ. de Alicante (Spain)
Program Committee
Univ. Autònoma de Barcelona (Spain)
Program Committee
Tsinghua Univ. (China)
Program Committee
Ctr. de Investigación y Desarrollo de Tecnología Digital (Mexico)
Program Committee
Lawrence Livermore National Lab. (United States)
Program Committee
Univ. of Denver (United States)
Program Committee
West Virginia Univ. (United States)
Program Committee
Univ. Autónoma de Baja California (Mexico)
Program Committee
Univ. of Florida (United States)
Program Committee
Utsunomiya Univ. Ctr. for Optical Research & Education (Japan)
Program Committee
Univ. Autònoma de Barcelona (Spain)
Additional Information
POST-DEADLINE ABSTRACTS ACCEPTED UNTIL 20 June
New submissions considered for poster session, or oral session if space becomes available
Contact author will be notified of acceptance by 8-July
View Submission Guidelines and Agreement
View the Call for Papers PDF

Submit Post-Deadline Abstract

What you will need to submit

  • Presentation title
  • Author(s) information
  • Speaker biography (1000-character max including spaces)
  • Abstract for technical review (200-300 words; text only)
  • Summary of abstract for display in the program (50-150 words; text only)
  • Keywords used in search for your paper (optional)
Note: Only original material should be submitted. Commercial papers, papers with no new research/development content, and papers with proprietary restrictions will not be accepted for presentation.