About

I am currently a Ph.D. student in the School of Computer Science at Georgia Institute of Technology, advised by Prof. Ling Liu. I got my B.S. and M.Sc. in the Department of  Electrical and Electronics Engineering at Bilkent University. My research interest is to detect new concepts, irregularities, or regime changes in continuous data to enhance the performance, stability, and robustness of models actively. In that sense, I am working on few-shot learning in diverse context, including federated learning, natural language processing, and computer vision.

Previously, I worked as a full-time Data Scientist at Databoss Analytics Inc. from 2019-2022. My research was mainly on time series analysis and forecasting for large-scale real-life applications, where the focus of research shifted according to the complexity of the given problem, from traditional approaches to developing novel deep learning architectures. Specifically, I worked on crime, weather, and natural gas consumption prediction, where the problem contains spatial, temporal, and categorical covariates of multiple time series.

Interests

Deep Learning (Graph Neural Networks, Deep Probabilistic Models, Generative Models), Time series analysis and forecasting (Autoregressive models, Nonstationarity and normalization problems), Random Processes (Gaussian Process Regressions, HMMs, and CRFs), Machine Learning (Gradient Boosting, Feature Selection)

Additionally, I enjoy participating in various sports.

Publications

2021


Crime Prediction with Graph Neural Networks and Multivariate Normal Distributions

Selim Furkan Tekin and Suleyman S. Kozat, published on IEEE Transactions on Signal Processing (TSP)

Existing approaches to the crime prediction problem are unsuccessful in expressing the details since they assign the probability values to large regions. This paper introduces a new architecture with the graph convolutional networks (GCN) and multivariate Gaussian distributions to perform high-resolution forecasting that applies to any spatiotemporal data. We tackle the sparsity problem in high resolution by leveraging the flexible structure of GCNs and providing a subdivision algorithm. We build our model with Graph Convolutional Gated Recurrent Units (Graph-ConvGRU) to learn spatial, temporal, and categorical relations. In each node of the graph, we learn a multivariate probability distribution from the extracted features of GCNs. We perform experiments on real-life and synthetic datasets, and our model obtains the best validation and the best test score among the baseline models with significant improvements. We show that our model is not only generative but also precise.

Spatio-temporal Weather Forecasting and Attention Mechanism on Convolutional LSTMs

Selim Furkan Tekin, Oguzhan Karaahmetoglu, Fatih Ilhan, Ismail Balaban, and Suleyman S. Kozat, Second Round Submission to Monthly Weather Review (MWR)

Numerical weather forecasting on high-resolution physical models consume hours of computations on supercomputers. The application of deep learning and machine learning methods in forecasting revealed new solutions in this area. In this paper, we forecast high-resolution numeric weather data using both input weather data and observations by providing a novel deep learning architecture. We formulate the problem as spatio-temporal prediction. Our model is composed of Convolutional Long-short Term Memory, and Convolutional Neural Network units with an encoder-decoder structure. We enhance the short-long term performance and interpretability with attention and a context matcher mechanism. We perform experiments on a high-scale, real-life, benchmark numerical weather dataset, ERA5 hourly data on pressure levels, and forecast the temperature. The results show significant improvements in capturing both spatial and temporal correlations with attention matrices focusing on different parts of the input series. Our model obtains the best validation and the best test score among the baseline models, including ConvLSTM forecasting network and U-Net. We provide qualitative and quantitative results and show that our model forecasts 10-time steps with the 3-hour frequency with an average of 2 degrees error. Our code and the data are publicly available.

Conference Proceedings

2020


Selim F. Tekin and Bilgin Aksoy, Multi-step Spatio-Temporal Temperature Forecasting Presented in 2020 28th Signal Processing and Communications Applications Conference (SIU), [paper], [code], [presentation]

Selim F. Tekin, Selim F. Yilmaz, and Ismail Balaban, Deep Learning on Limited Turkish Data, Presented in 2020 28th Signal Processing and Communications Applications Conference (SIU), [poster], [code], [presentation]

Fatih Ilhan, Selim F. Tekin, and Bilgin Aksoy Spatio-Temporal Crime Prediction with Temporally Hierarchical Convolutional Neural Networks, Presented in 2020 28th Signal Processing and Communications Applications Conference, [paper]

O. Karaahmetoglu, Fatih Irım, and S. F. Tekin, Interpretable Classification And Regression For Time Series Data, Presented in 2020 28th Signal Processing and Communications Applications Conference (SIU) [paper]

Selim F. Yilmaz, Ismail Balaban, Selim F. Tekin, and Suleyman S. Kozat, Hybrid Framework for Named Entity Recognition in Turkish Social Media, Presented in 2020 28th Signal Processing and Communications Applications Conference (SIU), [paper]

Projects

2023


Wavelet decomposition-based Markov switching model

We introduce a neural network architecture to dynamically switch between regimes in time-series data by modeling low and high frequencies. We obtain the frequency components with the wavelet decomposition, which are indicators for different regimes in input data, and we learn the regime-switching with Hidden Markov Models. Each state has 1D Convolutional operations that generate a time-series representation. Finally, we take the weighted average of each representation with the likelihood of states to determine the next value on the series. We learn the model parameters, including the state transition matrix with MLE.

2021


Shazam Signature Representation of Audio for Full-text Documents

A text document search engine based on Shazam audio fingerprint algorithm was developed. It is capable of matching a query by using a fragment of the original document’s content. The Google Text to Speech API service, as well as a raw-to-audio method, are used to transform the text into audio. The audio fingerprints method of Shazam is used to generate the signature of a text from the audio file. The engine achieved a 60% match hit ratio when using queries of around 80% of the original file. The system is scalable; storage and computationally efficient as the search is performed using signature instead of the document itself.

2020


Forecasting of Tropical Cyclone Trajectories with Deep Learning

We study forecasting hurricane (tropical cyclone) trajectories using deep learning techniques. Hurricane trajectories exhibit highly complex and nonlinear behavior. Numerous factors such as landscape, atmospheric effects cause tropical cyclones to follow devious or unwavering routes. We employ a recurrent neural network (RNN)-based deep learning architecture called TrajGRU to capture these complex temporal patterns and perform forecasts. We analyze the performance in terms of kilometers at various hourly resolutions and compare the results with two baseline methods including naive linear predictor and long short-term memory (LSTM) networks. We demonstrate the performance gains and analyze the predictions.

Headline Generation from Distinctive Summaries

In this paper, we demonstrate an LSTM based encoder-decoder network for the headline generation task. We used the state-of-the-art BERT language model to extract context vectors. The attention mechanism is used together with the encoder-decoder network to capture relevancy information in the input sequence and improve overall system performance. Several summaries are generated and according to their TF-IDF score, the best summary is used in headline generation instead of evaluating the network on the whole content. Generated headlines are compared according to the BLEU metric.

Estimation of Facial Attractiveness Level

We design and train a deep neural network for facial attractiveness level estimation. In particular, we develop an architecture based on convolution, pooling, activation, and fully-connected layers. We analyze the performance of our model over a preprocessed facial image dataset with attractiveness level annotations. In this report, we describe our architecture, training procedure, and obtained results on the given dataset. The architecture can reach 0.42 and 0.50 mean absolute error in validation and test sets respectively.

2019


Image Captioning With Attention and Transfer Learning

The aim of the project is to generate meaningful sentences describing a given image, which is referred to as Image Captioning in the literature. Due to its real-life applications, various solutions and datasets related to the Image Captioning problem were published and are available online. In this report, we first describe the analyses performed on the dataset. Then, we describe our approach to the problem by describing our model architecture and data flow. At every step, we compare our approach with the current works in the literature and explain the reasoning behind our algorithmic decisions by stating the advantages and disadvantages for different possibilities. Lastly, we provide results that present the performance of our approach.