Jose Luis Flores Campana

Jose Luis Flores Campana

Ph.D. in Computer Science

University of Campinas

Biography

Jose Luis Flores is a highly accomplished software engineer and researcher specializing in Machine Learning, Deep Learning, and Image Processing. He holds a B.Sc. in Computer and Software Engineering from the University of San Antonio Abad de Cusco (UNSAAC), Peru, where his undergraduate research focused on the recognition and classification of hand gestures in sign language using advanced machine and deep learning techniques. He pursued his M.Sc. in Computer Science at the University of Campinas (Unicamp), Brazil, graduating in 2020. During this time, Jose collaborated with SAMSUNG Brasil and Unicamp on groundbreaking projects, including multilingual text detection and recognition in images and videos, as well as the generation of parallax motion effects. In 2024, Jose completed his Ph.D. at Unicamp, where his research centered on cutting-edge topics like Image Inpainting and Image Synthesis. Leveraging state-of-the-art Deep Learning models such as Generative Adversarial Networks (GANs) and Vision Transformers, he made significant contributions to the fields of computer vision and image processing. Currently, Jose works as a software engineer at Loggi, a leading logistics and technology company. At Loggi, he has contributed to numerous impactful projects spanning backend, frontend, and machine learning domains. His backend expertise includes the development of APIs, microservices, event-driven architectures, and cloud computing solutions using AWS services like S3 and sagemaker. On the frontend, he has developed robust user interfaces with technologies such as React, JavaScript, Node.js, HTML, and TypeScript. Jose’s work is characterized by his innovative approach to solving complex problems and his passion for leveraging technology to drive meaningful impact.

Jose Luis Flores é um engenheiro de software e pesquisador altamente realizado, especializado em Aprendizado de Máquina, Aprendizado Profundo (Deep Learning) e Processamento de Imagens. Ele possui um Bacharelado em Engenharia de Computação e Software pela Universidade de San Antonio Abad de Cusco (UNSAAC), Peru, onde sua pesquisa de graduação focou no reconhecimento e classificação de gestos manuais em linguagem de sinais, utilizando técnicas avançadas de aprendizado de máquina e aprendizado profundo. Ele concluiu seu Mestrado em Ciência da Computação na Universidade Estadual de Campinas (Unicamp), Brasil, em 2020. Durante esse período, Jose colaborou com a SAMSUNG Brasil e a Unicamp em projetos inovadores, incluindo a detecção e reconhecimento de texto multilíngue em imagens e vídeos, além da geração de efeitos de movimento paralaxe. Em 2024, Jose concluiu seu Doutorado na Unicamp, onde sua pesquisa se concentrou em tópicos de ponta, como Preenchimento de Imagens (Image Inpainting) e Síntese de Imagens. Utilizando modelos de Aprendizado Profundo de última geração, como Redes Adversárias Generativas (GANs) e Vision Transformers, ele fez contribuições significativas aos campos de visão computacional e processamento de imagens. Atualmente, Jose trabalha como engenheiro de software na Loggi, uma empresa líder em logística e tecnologia. Na Loggi, ele contribuiu para diversos projetos impactantes, abrangendo as áreas de backend, frontend e aprendizado de máquina. Sua experiência em backend inclui o desenvolvimento de APIs, microsserviços, arquiteturas orientadas a eventos e soluções de computação em nuvem usando serviços AWS, como o S3 e Sagemaker. No frontend, ele desenvolveu interfaces de usuário robustas com tecnologias como React, JavaScript, Node.js, HTML e TypeScript. O trabalho de Jose é caracterizado por sua abordagem inovadora para resolver problemas complexos e sua paixão por alavancar a tecnologia para gerar impacto significativo.

Download my resumé.

Interests
  • Pattern Recognition
  • Computer Vision
  • Image Processing
  • Image Synthesis
  • Machine Learning
  • Deep Learning
Education
  • Ph.D. in Computer Science, 2024

    University of Campinas (IC/Unicamp)

  • M.Sc. in Computer Science, 2020

    University of Campinas (IC/Unicamp)

  • B.Sc. in Computer Engineering, 2017

    University San Antonio Abad of Cusco (UNSAAC)

Skills

Git

70%

Github

70%

Docker

80%

Java

60%

Python

90%

c++-logo
C++

70%

pytorch-logo
Pytorch

70%

tensorflow-logo
Tensorflow

60%

keras-logo
Keras

60%

SQL-server-logo
SQL Server

70%

looker-logo
Looker

50%

grafana-logo
Grafana

70%

elastic-logo
Elastic Search

70%

postgreSQL-logo
PostgreSQL

60%

javascript-logo
Javascript

50%

react-logo
React

60%

Experience

 
 
 
 
 
1oggi
Software Engineering
Aug 2021 – Present São Paulo

Responsibilities include:

  • Designed and implemented a machine learning-based solution to automate damaged package declarations, reducing processing time by 10x compared to manual methods. The application utilized text detection for damage assessment, cloud storage via Amazon S3 for photo management, and was developed using Python and JavaScript.
  • Built and deployed microservices and APIs for Loggi’s desktop and mobile applications using Python, JavaScript, React, and Node.js. These services enable real-time tracking of packages across various statuses—such as in-progress, delivered, and damaged—helping the operations team optimize decision-making.
  • Developed an intelligent chatbot to handle common customer inquiries, including tracking missing or in-progress packages. The chatbot improved response times by 5x, significantly enhancing customer satisfaction and operational efficiency.
 
 
 
 
 
Unicamp
Researcher
Unicamp
Mar 2020 – Apr 2024 Campinas

Responsibilities included:

  • Proposed an advanced image inpainting model combining CNNs and transformers to effectively address challenges posed by large missing regions. The model delivered competitive performance compared to state-of-the-art methods.
  • Developed an innovative variable hyperparameter strategy for transformers, significantly reducing computational complexity. The proposed approach demonstrated a 3x improvement in efficiency over recent methods.
  • Introduced a novel image inpainting model that leverages auxiliary information from the pencil sketch domain to address structural and textural inconsistencies. Achieved state-of-the-art results on datasets such as CelebA and Paris StreetView, and competitive performance on the Places365 dataset.
 
 
 
 
 
Unicamp and Samsung Electronics America
Researcher
Unicamp and Samsung Electronics America
Apr 2020 – Jun 2021 Campinas

Responsibilities include:

  • Developed innovative algorithms using scene representations such as Layered Depth Images (LDI) and Multiplane Images (MPI) for generating parallax motion effects from a single image. Proposed a lightweight scene representation tailored for constrained devices like smartphones, achieving 3% more efficient compared to recent MPI-based methods. This work was published in a peer-reviewed paper.
  • Implemented and evaluated advanced image inpainting algorithms utilizing GANs and Vision Transformers. Achieved competitive results on the Places2 dataset and outperformed state-of-the-art methods by 2–3% on CelebA and Paris Street View datasets. These results demonstrated the model’s ability to effectively address complex inpainting challenges.
 
 
 
 
 
Unicamp and Samsung Electronics America
Researcher
Unicamp and Samsung Electronics America
Aug 2018 – Mar 2020 Campinas

Responsibilities include:

  • Implemented advanced post-processing algorithms to address challenges in text localization methods using Tesseract OCR, achieving a 4% improvement in accuracy.
  • Developed and evaluated multilingual text localization and recognition algorithms optimized for devices with low computational resources, ensuring efficient and accurate performance in constrained environments.
 
 
 
 
 
Unicamp
Researcher
Unicamp
Jun 2018 – Jan 2020 Campinas

Responsibilities included:

  • Implemented advanced post-processing algorithms to address challenges in text localization methods using Tesseract OCR, achieving a 4% improvement in accuracy.
  • Developed and evaluated multilingual text localization and recognition algorithms optimized for devices with low computational resources, ensuring efficient and accurate performance in constrained environments.
 
 
 
 
 
UNSAAC
Researcher
UNSAAC
Jun 2016 – Jun 2017 Cusco

Responsibilities include:

  • Researched and developed a hand gesture detection and classification system for sign language using CNN-based deep learning, achieving 96% accuracy compared to handcrafted methods under different environments.
  • Created a new hand gesture dataset with 10,000+ images, incorporating variations like rotation, translation, background changes, and noise train and test our hand gesture detection and classification method
 
 
 
 
 
Brain Systems
Software Engineering
Jan 2016 – Jun 2017 Cusco

Responsibilities include:

  • Designed and developed APIs and services for generating XML files and PDF reports as part of the electronic invoicing project (BS EFACT). Played a key role in establishing one of the first electronic invoicing solutions in Cusco, leveraging efficient SQL procedures with SQL Server to ensure robust and scalable performance.

Recent Publications

Quickly discover relevant content by filtering publications.
(2024). Image Inpainting on the Sketch-Pencil Domain with Vision Transformers. 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications.

Cite DOI Link

(2023). Variable-hyperparameter visual transformer for efficient image inpainting. Computers & Graphics.

Cite DOI Link

(2022). Multi-Scale Patch Partitioning for Image Inpainting Based on Visual Transformers. 35th Conference on Graphics, Patterns and Images.

Cite DOI Link

(2021). Adaptive Multiplane Image Generation From a Single Internet Picture. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

PDF Cite Link

(2021). Pyramidal Layered Scene Inference with Image Outpainting for Monocular View Synthesis. Computer Analysis of Images and Patterns.

PDF Cite

(2020). Parallax Motion Effect Generation Through Instance Segmentation And Depth Estimation. 2020 IEEE International Conference on Image Processing (ICIP).

PDF Cite Link

(2020). On the Fusion of Text Detection Results: A Genetic Programming Approach. IEEE Access.

PDF Cite DOI Link

(2020). MobText: A Compact Method for Scene Text Localization. Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP,.

PDF Cite DOI Link

(2019). Pelee-Text: A Tiny Convolutional Neural Network for Multi-oriented Scene Text Detection. 2019 18th IEEE International Conference On Machine Learning And Applications ( ICMLA).

PDF Cite DOI Link

(2019). Multi-Lingual Text Localization via Language-Specific Convolutional Neural Networks. Anais Estendidos da XXXII Conference on Graphics, Patterns and Images.

PDF Cite DOI Link

(2017). Application of convolutional neural networks for static hand gestures recognition under different invariant features. 2017 IEEE XXIV International Conference on Electronics, Electrical Engineering and Computing (INTERCON).

PDF Cite DOI Link

Contact