Textual intelligent Systems Lab

Paper
- Journal
- Conference
Books


Saedeh Tahery, Saeed Farzi. 2020. Neurological Disorders and Imaging Physics, Volume 4 (Chapter 2: Application of machine learning algorithms to diagnosis attention deficit hyperactivity disorder). IOP Publishing.


ReQuEST: A small-scale multi-task model for community question-answering systems (Journal: IEEE Access). 2024. Seyyede Zahra Aftabi, Seyyede Maryam Seyyedi, Mohammad Maleki, Saeed Farzi. [ Link ]
Managing multi-facet bias in collaborative filtering recommender systems. 2023. Samira Vaez Barenji, Saeed Farzi. Due to the extensive growth of information available online, recommender systems play a more significant role in serving people’s interests. Traditional recommender systems mostly use an accuracy-focused approach to produce recommendations. Today’s research suggests that this single-dimension approach can lead the system to be biased against a series of items with certain attributes. Biased recommendations across groups of items can endanger the interests of item providers along with causing user dissatisfaction with the system. This study aims to manage a new type of intersectional bias regarding the geographical origin and popularity of items in the output of state-of-the-art collaborative filtering recommender algorithms. We introduce an algorithm called MFAIR, a multi-facet post-processing bias mitigation algorithm to alleviate these biases. Extensive experiments on two real-world datasets of movies and books, enriched with the items’ continents of production, show that the proposed algorithm strikes a reasonable balance between accuracy and both types of the mentioned biases. According to the results, our proposed approach outperforms a well-known competitor with no or only a slight loss of efficiency. [ Link ]
Fraud detection in financial statements using data mining and GAN models (Journal: Expert Systems with Applications). 2023. Seyyede Zahra Aftabi, Ali Ahmadi, Saeed Farzi. Financial statements are analytical reports published periodically by financial institutions explaining their performance from different perspectives. As these reports are the fundamental source for decision-making by many stakeholders, creditors, investors, and even auditors, some institutions may manipulate them to mislead people and commit fraud. Fraud detection in financial statements aims to discover anomalies caused by these distortions and discriminate fraud-prone reports from non-fraudulent ones. Although binary classification is one of the most popular data mining approaches in this area, it requires a standard labeled dataset, which is often unavailable in the real world due to the rarity of fraudulent samples. This paper proposes a novel approach based on the generative adversarial networks (GAN) and ensemble models that is able to not only resolve the lack of non-fraudulent samples but also handle the high-dimensionality of feature space. A new dataset is also constructed by collecting the annual financial statements of ten Iranian banks and then extracting three types of features suggested in this study. Experimental results on this dataset demonstrate that the proposed method performs well in generating synthetic fraud-prone samples. Moreover, it attains comparative performance with supervised models and better performance than unsupervised ones in accurately distinguishing fraud-prone samples. [ Link ]
Multi-Module G2P Converter for Persian Focusing on Relations between Words. 2022. Mahdi Rezaei; Negar Nayeri; Saeed Farzi; Hossein Sameti. In this paper, we investigate the application of end-to-end and multi-module frameworks for G2P conversion for the Persian language. The results demonstrate that our proposed multi-module G2P system outperforms our end-to-end systems in terms of accuracy and speed. The system consists of a pronunciation dictionary as our look-up table, along with separate models to handle homographs, OOVs and ezafe in Persian created using GRU and Transformer architectures. The system is sequence-level rather than word-level, which allows it to effectively capture the unwritten relations between words (cross-word information) necessary for homograph disambiguation and ezafe recognition without the need for any pre-processing. After evaluation, our system achieved a 94.48% word-level accuracy, outperforming the previous G2P systems for Persian. [ Link ]
A GA-based algorithm meets the fair ranking problem (Journal: Information Processing & Management). 2021. Saedeh Tahery, Seyyede Zahra Aftabi, Saeed Farzi. Ranking items is a vital component in almost every application dealing with selecting the most suitable items among a pool of candidates. Yet, specific individuals or groups may be systematically disadvantaged in getting the opportunity of appearing on the ranking list. The fair ranking problem aims at mitigating the bias imposed on protected groups (i.e., disadvantaged groups) while preserving the total quality of the ranking list as high as possible. FAIR is one of the existing algorithms, which finds the exact solutions for only one protected group, considering a given minimum number of protected items at every prefix of a ranking list. However, when an item belongs to more than two protected groups achieving optimal solutions gets more difficult. This paper proposes an algorithm called FARGO, a fair ranking algorithm based on the genetic algorithm (GA) enhanced by the simulated annealing (SA) that is able to handle any number of protected groups. A new objective function is also proposed by incorporating the main goals of the problem, which is utilized as FARGO’s fitness function. Furthermore, a novel evaluation metric named Expected Gain Ratio (EGR) is introduced to assess a fair ranking algorithm’s output. Experimental results on real-world datasets demonstrate that FARGO attains comparative performance with FAIR for one protected group and finds near-optimal solutions for more than one protected group in terms of NDCG and EGR. Note that involving other concepts such as exposure is not a matter of this paper and can be an interesting subject for further studies. [ Link ]
An enhanced personality detection system through user’s digital footprints (Journal: Digital Scholarship in the Humanities). 2021. Mohammad Mobasher, Saeed Farzi. One of the most important aspects of any person’s life is personality, which affects one’s speech, decision, well-being, feeling and mental health. Personality detection is usually based on data collected by a questionnaire that comprises some critical problems such as the lack of direct access to the individuals and explicit personal information. However nowadays, one of the valuable resources for such studies is social networks. The footprint and tracking of users on social networks have provided valuable information for personality recognition. Specifically, this research introduces an intelligence personality recognition system based on modeling user behavior using sophisticated features, i.e., Statistical, Emotional, and Linguistic. Furthermore, a dataset called KNTU_Personality based on the MBTI personality model with the profile information and tweets has been collected. The experimental study follows two scenarios with complementing objectives. First, the sensitivity analysis is performed respecting to setting parameters, introduced features and different learning algorithms. Next, the proposed system has been compared with well-known personality detection systems. The results demonstrate the superiorities of the proposed system regarding its counterparts in terms of F-Score, Precision, Recall and Accuracy. [ Link ]
A New Clustering Algorithm for Attributive Graphs through Information Diffusion Approaches (Journal: Journal of Electrical and Computer Engineering Innovations (JECEI)). 2020. Sahar Kianian, Saeed Farzi, Hamed Samak. Background and Objectives: Simplicity and flexibility constitute the two basic features for graph models which has made them functional models for real-life problems. The attributive graphs are too popular among researchers because of their efficiency and functionality. An attributive graph is a graph of the nodes and edges of which can be attributive. Nodes and edges as structural dimension and their attributes as contextual dimension made graphs more flexible in modeling real problems. Methods: In this study, a new clustering algorithm is proposed based on K-Medoid which focuses on graph’s structure dimension, through heat diffusion algorithm and contextual dimension through weighted Jaccard coefficient in a simultaneous matter. The calculated clusters through the proposed algorithm are of denser and nodes with more similar attributes. Results: DBLP and PBLOG real data sets are applied to evaluate and compare this algorithm with new and well-known cluster algorithms. Conclusion: Results indicate the outperformers of this algorithm in relation to its counterparts as to structure quality, cluster contextual and time complexity criteria. [ Link ]
TIPS: Time-aware Personalised Semantic-based query auto-completion (Journal: Journal of Information Science). 2020. Saedeh Tahery, Saeed Farzi. With the rapid growth of the Internet, search engines play vital roles in meeting the users’ information needs. However, formulating information needs to simple queries for canonical users is a problem yet. Therefore, query auto-completion, which is one of the most important characteristics of the search engines, is leveraged to provide a ranked list of queries matching the user’s entered prefix. Although query auto-completion utilises useful information provided by search engine logs, time-, semantic- and context-aware features are still important resources of extra knowledge. Specifically, in this study, a hybrid query auto-completion system called TIPS (Time-aware Personalised Semantic-based query auto-completion) is introduced to combine the well-known systems performing based on popularity and neural language model. Furthermore, this system is supplemented by time-aware features that blend both context and semantic information in a collaborative manner. Experimental studies on the standard AOL dataset are conducted to compare our proposed system with state-of-the-art methods, that is, FactorCell, ConcatCell and Unadapted. The results illustrate the significant superiorities of TIPS in terms of mean reciprocal rank (MRR), especially for short-length prefixes. [ Link ]
An Ensemble Click Model for Web Document Ranking (Journal: International Journal of Engineering (IJE)). 2020. Danial Bidekani Bakhtiarvand, Saeed Farzi. Annually, web search engine providers spend a lot of money on re-ranking documents in search engine result pages (SERP). Click models provide advantageous information for re-ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to predict users’ clicks on SERPs simultaneously, the first module tries to predict users’ click behaviors using Probabilistic Graphical Models, the second module is a Time-series Deep Neural Click Model which predicts users’ clicks on documents and finally, the third module is a similarity-based measure which creates a graph of document-query relations and uses SimRank Algorithm to predict the similarity. After running these three simultaneous processes, three click probability values are fed to an MLP classifier as inputs. The MLP classifier learns to decide on top of the three preceding modules, then it predicts a probability value which shows how probable a document is to be clicked by a user. The proposed system is evaluated on the Yandex dataset as a standard click log dataset. The results demonstrate the superiority of our model over the well-known click models in terms of perplexity. [ Link ]
C-Blondel: An Efficient Louvain-Based Dynamic Community Detection Algorithm (Journal: IEEE Transactions on Computational Social Systems). 2020. Mahsa Seifikar, Saeed Farzi, Masoud Barati. One of the most interesting topics in the scope of social network analysis is dynamic community detection, keeping track of communities’ evolutions in a dynamic network. This article introduces a new Louvain-based dynamic community detection algorithm relied on the derived knowledge of the previous steps of the network evolution. The algorithm builds a compressed graph, where its supernodes represent the detected communities of the previous step and its superedges show the edges among the supernodes. The algorithm not only constructs the compressed graph with low computational complexity but also detects the communities through the integration of the Louvain algorithm into the graph. The efficiency of the proposed algorithms is widely investigated in this article. By doing so, several evaluations have been performed over three standard real-world data sets, namely Enron Email, Cit-HepTh, and Facebook data sets. The obtained results indicate the superiority of the proposed algorithm with respect to the execution time as an efficiency metric. Likewise, the results show the modularity of the proposed algorithm as another effectiveness metric compared with the other well-known related algorithms. [ Link ]
MDPCluster: a swarm-based community detection algorithm in large-scale graphs (Journal: Computing). 2020. Mahsa Fozuni Shirjini, Saeed Farzi, Amin Nikanjam. Social network analysis has become an important topic for researchers in sociology and computer science. Similarities among individuals form communities as the basic constitutions of social networks. Regarding the importance of communities, community detection is a fundamental step in the study of social networks typically modeled as large-scale graphs. Detecting communities in such large-scale graphs which generally suffers from the curse of dimensionality is the main objective followed in this study. An efficient modularity-based community detection algorithm called MDPCluster is introduced in order to detect communities in large-scale graphs in a timely manner. To address the high dimensionality problem, first, a Louvain-based algorithm is utilized by MDPCluster to distinguish initial communities as super-nodes and then a Modified Discrete Particle Swarm Optimization algorithm, called MDPSO is leveraged to detect communities through maximizing modularity measure. MDPSO discretizes Particle Swarm Optimization using the idea of transmission tendency and also escapes from premature convergence thereby a mutation operator inspired by Genetic Algorithm. To evaluate the proposed method, six standard datasets, i.e., American College Football, Books about US Politics, Amazon Product Co-purchasing, DBLP, GR-QC and HEP-TH have been employed. The first two are known as synthetic datasets whereas the rest are real-world datasets. In comparison to eight state-of-the-art algorithms, i.e., Stationary Genetic Algorithm, Generational Genetic Algorithm, Simulated Annealing-Stationary Genetic Algorithm, Simulated Annealing-Generational Genetic Algorithm, Grivan–Newman, Danon and Label Propagation Algorithm, the results indicate the superiorities of MDCluster in terms of modularity, Normalized Mutual Information and execution time as well. [ Link ]
Customized query auto-completion and suggestion — A review (Journal: Information Systems). 2020. Saedeh Tahery, Saeed Farzi. Nowadays, with the widespread use of the internet, users meet their information needs with the help of search engines. Users tend to retrieve the most relevant results by entering short phrases in the search engines. Customizing the retrieved results helps attain this goal. In this study, research works in the fields of query suggestion, particularly query auto-completion have been studied with special attention to customization. First, the sophisticated customizing features were classified into four dimensions: time, location, context, and demographic features. Then, related works were investigated regarding algorithm, dataset and evaluation measures. Regarding the literature, we found that the research works employing context or time as sophisticated features for customization are more than those using location or demographic features. While the location dimension has been recently taken into consideration, using other dimensions has a long background. Moreover, in the related works, the AOL dataset and Mean Reciprocal Rank (MRR) are known as the most frequent dataset and evaluation measure, respectively. [ Link ]
Assessment of Customer Credit Risk using an Adaptive Neuro-Fuzzy System (Journal: Journal of Computer and Knowledge Engineering). 2019. Sahar Kianian, Saeed Farzi. Given the financial crises in the world, one of the most important issues of banking industry is the assessment of customers’ credit to distinguish bad credit customers from good credit customers. The problem of customer credit risk assessment is a binary classification problem, which suffers from the lack of data and sophisticated features as main challenges. In this paper, an adaptive neuro-fuzzy inference system is exploited to tackle the customer credit risk assessment problem regarding the mentioned challenges. First of all, a SOMTE-based algorithm is introduced to overcome the data imbalancing problem. Then, several efficient features are identified using a MEMETIC metaheuristic algorithm, and finally an adaptive neuro-fuzzy system is exploited for distinguishing bad credit customers from good ones. To evaluate and compare the performance of the proposed system, the standard German credit data dataset and the well-known classification algorithms are utilized. The results indicate the superiority of the proposed system compared to some well-known algorithms in terms of precision, accuracy, and Type II errors. [ Link ]
Designing and Implementing an Emotion Analytic System (EAS) on Instagram Social Network Data (Journal: International Journal of Web Research). 2019. Seyed Faridoddin Kiaei, Mohammad Dehghan Rouzi, Saeed Farzi. Being aware of people’s attitudes and emotions about a specific person or an event can have a high impact on the decisions of individuals and organizations. With the rise of social networks, specifically Instagram, many people are sharing their attitudes on this social network. Analyzing the emotions of users of this social network can help managers make organizational decisions and predict essential events such as elections. In this research, the EAS system designed and implemented to extract emotions and visualize them. As a practical example, the Instagram users’ feelings about the two main candidates for the 12th Iranian presidential election also examined. The data were Instagram Persian comments collected using a developed crawler. The result shows a more positive feeling about Rouhani in comparison with Raeisi. Also, the lexicon-based analysis of Rouhani revealed a high level of trust emotion, along with anger and disgust. The crawled and preprocessed dataset is publicly available at https://github.com/sfdk74/EAS. [ Link ]
A neural reordering model based on phrasal dependency tree for statistical machine translation (Journal: Intelligent Data Analysis). 2018. Saeed Farzi, Heshaam Faili, Sahar Kianian. Machine translation is an important field of research and development. Word reordering is one of the main problems in machine translation. It is an important factor of quality and efficiency of machine translations and becomes more difficult when it deals with structurally divergent language pairs. To overcome this problem, we introduce a neural reordering model, using phrasal dependency trees which depict dependency relations among contiguous non-syntactic phrases. The model makes the use of reordering rules, which are automatically learned by a probabilistic neural network classifier from a reordered phrasal dependency tree bank. The proposed model combines the power of the lexical reordering and syntactic pre-ordering models by performing long-distance reorderings. The proposed reordering model is integrated into a standard phrase-based statistical machine translation system to translate input sentences. Our method is evaluated on syntactically divergent language-pairs, English → Persian and English → German using WMT07 benchmark. The results illustrate the superiority of the proposed method in terms of BLEU, TER and LRscore on both translation tasks. On average the proposed method retrieves a significant impact on precision and recall values respect to the hierarchical, lexicalized and distortion reordering models. [ Link ]
A novel clustering algorithm for attributed graphs based on K-medoid algorithm (Journal of Experimental & Theoretical Artificial Intelligence). 2018. Saeed Farzi, Sahar Kianian. Articulateness and plasticity are two essential attributes that make a graph as an efficient model to real life problems. Nowadays, the attributed graph is received lots of attentions because of usability and effectiveness. In this study, a novel k-Medoid based clustering algorithm, which focuses simultaneously on both structural and contextual aspects using Signal and the weighted Jaccard similarities, are introduced. Two real life data-sets, Political Blogs and DBLP bibliography, are employed in order to evaluate and compare the proposed algorithm with state-of-the-art clustering algorithms. [ Link ]
A preordering model based on phrasal dependency tree (Journal: Digital Scholarship in the Humanities). 2018. Saeed Farzi, Heshaam Faili, Sahar Kianian. Intelligent machine translation (MT) is becoming an important field of research and development as the need for translations grows. Currently, the word reordering problem is one of the most important issues of MT systems. To tackle this problem, we present a source-side reordering method using phrasal dependency trees, which depict dependency relations between contiguous non-syntactic phrases. Reordering elements are automatically learned from a reordered phrasal dependency tree bank and are utilized to produce a source reordering lattice. The lattice finally is decoded by a monotone phrase-based SMT to translate a source sentence. The approach is evaluated on syntactically divergent language pairs, ie English to Persian and English to German, using the workshop of machine translation 2007 (WMT07) benchmark. [ Link ]
Katibeh: A Persian news summarizer using the novel semi-supervised approach (Journal: Digital Scholarship in the Humanities). 2018. Saeed Farzi, Sahar Kianian. Nowadays, text summarization is one of the most important active research fields in information retrieval. The most of the supervised extractive summarization systems utilize learning-to-rank methods to score sentences according to their importance. They need a high-quality comprehensive summarization corpus, which is labeled manually by human experts. Unfortunately, this sort of corpus is not available for most low-resource languages such as Persian. In this study, first of all, a comprehensive human-labeled summarization corpus (called Bistoon) collected by the crowdsourcing approach is introduced, and then a Persian summarizer based on a novel semi-supervised summarization approach, which is a combination of co-training and self-training, is presented to overcome the absence of sufficient data. [ Link ]
A comprehensive study of online event tracking algorithms in social networks (Journal of Information Science). 2018. Mahsa Seifikar, Saeed Farzi. Recently, social networks have provided an important platform to detect trends of real-world events. The trends of real-world events are detected by analysing flow of massive bulks of data in continuous time steps over various social media platforms. Today, many researchers have been interested in detecting social network trends, in order to analyse the gathered information for enabling users and organisations to satisfy their information need. This article is aimed at complete surveying the recent text-based trend detection approaches, which have been studied from three perspectives (algorithms, dimension and diversity of events). The advantages and disadvantages of the considered approaches have also been paraphrased separately to illustrate a comprehensive view of the previous works and open problems. [ Link ]


DIPT: Diversified Personalized Transformer for QAC systems (2023 13th International Conference on Computer and Knowledge Engineering (ICCKE)). 2023. Mahdi Dehghani, Samira Vaez Barenji, Saeed Farzi. Today, with the explosion of information on the web, search engines play a more prominent role in serving users’ information needs. Query auto-completion (QAC) is one of the most crucial aspects of search engines that helps users formulate relevant and precise queries based on their information needs. A QAC system generates a list of query candidates according to the user’s provided prefix and then updates it with each new keystroke. The existing methods mostly focus on personalizing query candidates to make them revolve around the user’s interests and past interactions. Due to the limited number of suggested queries, mere personalization can result in pushing redundant suggestions into the list as well as the exclusion of effective ones. In this paper, we address the diversification task in QAC systems, presenting a novel method called DIPT, a Diversified Personalized Transformer for QAC systems. The proposed method diversifies the suggested queries to include the potential future interests of users in addition to their past interests and interactions. Experimental results on the AOL standard dataset demonstrate the advantage of DIPT over state-of-the-art personalized QAC systems in terms of the MRR(Mean Reciprocal Rank) criterion. [ Link ]
Dynamic knowledge graph completion through time-aware relational message passing (2023 28th International Computer Conference, Computer Society of Iran (CSICC)). 2023. Amirhossein Baqinejadqazvini, Saedeh Tahery, Saeed Farzi. As the structure of knowledge graphs may vary over time, static knowledge graph completion methods do not deal with time-varying knowledge graphs. However, examining the paths between entities and entities’ context information can lead to more accurate completion methods. This paper attempts to complete dynamic (time-varying) knowledge graphs by combining time-aware relational paths and relational context. The proposed model can improve dynamic knowledge graph completion methods by leveraging neural networks. Experimental results conducted on two standard datasets, ICEWS14 and ICEWS05-15, indicate our model’s superiority in terms of Mean Reciprocal Rank (MRR) and Hit@k over its well-known counterparts, such as DE-TransE and DE-DistMult. [ Link ]
Effective synthetic data generation for fake user detection (2021 26th International Computer Conference, Computer Society of Iran (CSICC)). 2021. Arefeh Esmaili, Saeed Farzi. Nowadays, with the pervasiveness of social networks among people, the possibility of publishing incorrect information has increased more than before. Therefore, detecting fake news and users who publish this incorrect information is of great importance. This paper has proposed a system based on combining context-user and context-network features with the help of a conditional generative adversarial network for balancing the data set to detect users who publish incorrect information in the Persian language on Twitter. Moreover, by conducting numerous experiments, the proposed system in terms of evaluation metrics compared to its competitors, has produced good performance results in detecting fake users. [ Link ]
A hybrid click model for web search (2019 5th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS)). 2019. Danial Bidekani Bakhtiarvand, Saeed Farzi. Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantegeous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, a hybrid click model is introduced by combining a PGM-based and a neural network click model. Hybrid click model tries to predict users’ clicks behavior on the documets which are represented in SERPs. Indeed, a weighted k-nearest neighbors has been employed to provide final decision based on UBM and LSTM click models scores. The proposed system is evaluated on the Yandex dataset as a standard click log data set. The results demonstrate the superiority of our model over the state-of-the-art click models in terms of perplexity. [ Link ]
Persian Rumor Detection on Twitter (2018 9th International Symposium on Telecommunications (IST)). 2018. Sajjad Dehghani Mahmoodabad, Saeed Farzi, Danial Bidekani Bakhtiarvand. Nowadays, one of the common ways of news broadcasting is sharing news through online social media. Some users consider these social media as a platform for news broadcasting. Every day numerous news transfer among users. However, sometimes rumors get around between users, such that they may make some mistakes about what are exactly happened. If rumors has been recognized at the right time, their negative effects can be bounded. In order to differentiate between rumors and non-rumors tweets, various well-known machine learning methods are applied on KNTUPT dataset which is collected all persian tweets from November 24th, 2017 to December 8th, 2017. The results indicate that the Random forest and meta.RandomSubSpace show their superiority than other methods. [ Link ]
Kermanshah Earthquake Event Tracking Through Persian Tweets (2018 9th International Symposium on Telecommunications (IST)). 2018. Mahsa Seifikar, Saeed Farzi, Sajjad Dehghani Mahmoodabad. By expanding online social networks during the last decade, online event tracking has become one of the vital issues among researchers. In this study, Persian tweets which exchange among Iranian users have been considered in order to track hidden patterns of evolutionary Kermanshah earthquake event. A well-known dynamic community detection algorithm has been applied over key-graphs which are generated from extracted keywords of tweets. The key-graphs are generated from KNTUPT dataset which consists of entire Persian tweets between November 24th, 2017 and December 8th, 2017. The experimental study reveals the dynamic community detection algorithm had a superior result in terms of modularity as community quality and execution time as performance. [ Link ]
Community detection in social networks using a novel algorithm without parameter ( 2017 IEEE 4th International Conference on Knowledge-Based Engineering and Innovation (KBEI)). 2017. Binazir Balegh, Saeed Farzi. With regard to the expansion of networks like computer network and virtual social networks, many researchers have been drawn to the analysis of social networks. Community detection in social networks is one of the most important issues in this domain. In recent years, numerous algorithms have been introduced in order to detect communities. In the present study, by determining the number of clusters and specifying initial centres of clusters, a novel algorithm has been introduced. By defining the neighbourhood of nodes, the proposed algorithm selects the initial centres of clusters in a way that the centres are at the maximum distance from each other. [ Link ]
TPC: An Automatically Generated Comprehensive English-Persian Parallel Corpus (2017 5th International Symposium on Computational and Business Intelligence (ISCBI)). 2017. Saeed Farzi, Heshaam Faili. Nowadays, Parallel corpus is one of the most important resources which can be employed in different researches such as machine translation, bilingual lexicography, and linguistics. This paper describes the process of building a large-scale (about 400, 000 sentence pairs) English-Persian parallel corpus called Tehran Parallel Corpus (TPC). The aim of study is to introduce the structure and explain the materials utilized for constructing TPC. In addition, some useful tools developed within the project have been introduced and three sorts of the statistical machine translation systems trained by TPC have been considered. In order to develop a high quality parallel corpus, unsure alignments recognized via a MaxEnt classifier have been eliminated from the corpus. [ Link ]
Diagnosis of attention deficit hyperactivity disorder using deep belief network based on greedy approach (2017 5th International Symposium on Computational and Business Intelligence (ISCBI)). 2017. Saeed Farzi, Sahar Kianian, Ilnaz Rastkhadive. Attention deficit hyperactivity disorder creates conditions for the child as s/he cannot sit calm and still, control his/her behavior and focus his/her attention on a particular issue. Five out of every hundred children are affected by the disease. Boys are three times more than girls at risk for this complication. The disorder often begins before age seven, and parents may not realize their children problem until they get older. Children with hyperactivity and attention deficit are at high risk of conduct disorder, antisocial personality, and drug abuse. Most children suffering from the disease will develop a feeling of depression, anxiety and lack of self-confidence. Given the importance of diagnosis the disease, Deep Belief Networks (DBNs) were used as a deep learning model to predict the disease. [ Link ]
Predicting serious diabetic complications using hidden pattern detection (2017 IEEE 4th International Conference on Knowledge-Based Engineering and Innovation (KBEI)). 2017. Saeed Farzi, Sahar Kianian, Ilnaz Rastkhadive. Diabets is known as a metabolic disease. Type 1 diabetes is anti-immune disease whereby the body’s immune system kills off its own insulin producing beta cells in the pancreas. Type 2 diabetes is an advanced state of health in which the body becomes opposed to the usual impacts of insulin and/or progressively loses the capacity to produce adequate amount insulin in the pancreas, and it finally may not be able to produceany insulin. Type 2 diabetes is the most common form of the disease with complications including heart, vision, and foot conditions. This study aimed at predicting some important complications of Type 2 diabetes such as heart disease, retinopathy, diabetic foot, neuropathy, and nephropathy in order to provide useful information for patients. In this paper, inaddition to introduce verity of sophisticated features forpredicting Type 2 diabetics complications, a new dataset aiming at predicting Type 2 diabetics complications has been collected. [ Link ]