{"id":140,"date":"2016-11-22T16:39:14","date_gmt":"2016-11-22T13:39:14","guid":{"rendered":"http:\/\/blog.metu.edu.tr\/atemizel\/?page_id=140"},"modified":"2023-06-03T19:37:58","modified_gmt":"2023-06-03T16:37:58","slug":"datasets","status":"publish","type":"page","link":"https:\/\/blog.metu.edu.tr\/atemizel\/datasets\/","title":{"rendered":"Datasets"},"content":{"rendered":"<h2 style=\"text-align: center\"><a href=\"https:\/\/doi.org\/10.5281\/zenodo.5827695\">Labeled Images for Ulcerative Colitis (LIMUC) Dataset<\/a><\/h2>\n<div class=\"record-description\">\n<p>The LUMIC dataset compromises\u00a011276 images from 564 patients and 1043 colonoscopy procedures, who underwent colonoscopy for ulcerative colitis between December 2011 and July 2019 at the Department of Gastroenterology in Marmara University School of Medicine.\u00a0Two experienced gastroenterologists blindly reviewed and classified all images according to the endoscopic Mayo score (EMS). Images that were differently labeled by two reviewers were also labeled by a third experienced reviewer\u00a0independently without seeing their previous labels. The final EMS for differently labeled images was determined using majority voting.<\/p>\n<p><strong>Suggested Metrics<\/strong><\/p>\n<p>Since there are imbalances and ordinality among classes (Mayo-0, Mayo-1, Mayo-2, Mayo-3), quadratic weighted kappa (QWK) can be used as the main performance metric.\u00a0The QWK is one of the commonly used statistics for the assessment of agreement on an ordinal scale and it is one of the best singular performance metrics for this problem regarding class imbalances. Mean absolute error (MAE), macro F1 score,\u00a0or macro accuracy can be used as\u00a0alternative performance metrics.<\/p>\n<p><strong>LIMUC Code Repository<\/strong><\/p>\n<p>Many scripts for preprocessing, splitting, training, and validating the dataset are provided in\u00a0<a href=\"https:\/\/github.com\/GorkemP\/labeled-images-for-ulcerative-colitis\">this<\/a>\u00a0GitHub\u00a0repository.<\/p>\n<p><strong>Terms and Conditions<\/strong><\/p>\n<p>In all documents and publications that use the LIMUC dataset or report experimental results based on the LIMUC dataset,\u00a0citation should be\u00a0included.<\/p>\n<\/div>\n<h2><a href=\"https:\/\/doi.org\/10.5281\/zenodo.7742659\">EOAD (Egocentric Outdoor Activity Dataset)<\/a><\/h2>\n<div class=\"record-description\">\n<p>EOAD is a collection of videos captured by wearable cameras, mostly of sports activities. It contains both visual and audio modalities.<\/p>\n<p>The selection of videos was based on the following criteria:<\/p>\n<ul>\n<li>The videos should not include text overlays.<\/li>\n<li>The videos should contain natural sound (no external music)<\/li>\n<li>The actions in videos should be continuous (no cutting the scene or jumping in time)<\/li>\n<\/ul>\n<p>Video samples were trimmed depending on scene changes for long videos (such as\u00a0<em>driving<\/em>,\u00a0<em>scuba diving<\/em>, and\u00a0<em>cycling<\/em>). As a result, a video may have several clips depicting egocentric actions. Hence, video clips were extracted from carefully defined time intervals within videos. The final dataset includes video clips with a single action and natural audio information.<\/p>\n<p>Statistics for EOAD:<\/p>\n<ul>\n<li><strong>30<\/strong>\u00a0activities<\/li>\n<li><strong>303<\/strong>\u00a0distinct videos<\/li>\n<li><strong>1392<\/strong>\u00a0video clips<\/li>\n<li><strong>2243<\/strong>\u00a0minutes labeled videos clips<\/li>\n<\/ul>\n<\/div>\n<h2 style=\"text-align: center\"><a href=\"https:\/\/zenodo.org\/record\/3841039#.XtimCEUzbBU\">BreathBase: Intra-Speech Breathing Dataset<\/a><\/h2>\n<p>BreathBase contains 5070 breath instances detected on the recordings of 20 participants reading pre-prepared random pseudo texts in 5 different postures with 4 different microphones, simultaneously.<\/p>\n<p>It is recorded in a studio with a maximum background noise of 40 dB SPL and with professional recording equipment. It also provides tagging for 5 different postures and 4 different channels as different recording conditions for data variety.<\/p>\n<p>More than 90% of the recordings is shorter than 600 milliseconds. The minimum number of breath instances per participant is 89, the maximum number of instances is 710 and the average for all participants is 253.5 breath instances<\/p>\n<h2 style=\"text-align: center\"><strong><br \/>\nCamera Sabotage \/ Camera Tamper Detection Dataset<\/strong><\/h2>\n<p style=\"text-align: center\">Citation Information:<br \/>\nA.Saglam, A.Temizel, \u201cReal-time Adaptive Camera Tamper Detection for Video Surveillance\u201d, in Proceedings IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Sept. 2009.<\/p>\n<div id=\"attachment_143\" style=\"width: 330px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CamOutofFocus.gif\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-143\" class=\"wp-image-143 size-full\" src=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CamOutofFocus.gif\" alt=\"camoutoffocus\" width=\"320\" height=\"240\" \/><\/a><p id=\"caption-attachment-143\" class=\"wp-caption-text\"><a href=\"ftp:\/\/ftp.vrcv.ii.metu.edu.tr\/Datasets\/Camera%20Out%20of%20Focus\">Camera Out of Focus Videos<\/a><\/p><\/div>\n<div id=\"attachment_142\" style=\"width: 330px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CamMoved.gif\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-142\" class=\"wp-image-142 size-full\" src=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CamMoved.gif\" alt=\"cammoved\" width=\"320\" height=\"240\" \/><\/a><p id=\"caption-attachment-142\" class=\"wp-caption-text\"><a href=\"ftp:\/\/ftp.vrcv.ii.metu.edu.tr\/Datasets\/Camera%20Moved\">Camera Moved Videos<\/a><\/p><\/div>\n<div id=\"attachment_141\" style=\"width: 330px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CamCoveredByHand.gif\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-141\" class=\"wp-image-141 size-full\" src=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CamCoveredByHand.gif\" alt=\"camcoveredbyhand\" width=\"320\" height=\"240\" \/><\/a><p id=\"caption-attachment-141\" class=\"wp-caption-text\"><a href=\"ftp:\/\/ftp.vrcv.ii.metu.edu.tr\/Datasets\/Camera%20Covered\">Camera Covered Videos<\/a><\/p><\/div>\n<h2>Crowd Behaviour Analysis Dataset<\/h2>\n<p style=\"text-align: center\">Citation Information:<br \/>\nC. Ongun, A.Temizel, T.Taskaya Temizel, \u201cLocal Anomaly Detection in Crowded Scenes Using Finite-Time Lyapunov Exponent Based Clustering\u201d, IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Aug. 2014.<\/p>\n<div id=\"attachment_145\" style=\"width: 385px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-145\" class=\"wp-image-145 size-full\" src=\"https:\/\/blog.metu.edu.tr\/atemizel\/files\/2016\/11\/CrowdFight.gif\" alt=\"crowdfight\" width=\"375\" height=\"281\" \/><p id=\"caption-attachment-145\" class=\"wp-caption-text\">C<a href=\"ftp:\/\/ftp.vrcv.ii.metu.edu.tr\/Datasets\/Crowd\/\">rowd Anomaly Dataset<\/a><\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Labeled Images for Ulcerative Colitis (LIMUC) Dataset The LUMIC dataset compromises\u00a011276 images from 564 patients and 1043 colonoscopy procedures, who underwent colonoscopy for ulcerative colitis between December 2011 and July 2019 at the Department of Gastroenterology in Marmara University School of Medicine.\u00a0Two experienced gastroenterologists blindly reviewed and classified all images according to the endoscopic Mayo [&hellip;]<\/p>\n","protected":false},"author":2096,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":"","_links_to":"","_links_to_target":""},"class_list":["post-140","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/pages\/140","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/users\/2096"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/comments?post=140"}],"version-history":[{"count":0,"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/pages\/140\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.metu.edu.tr\/atemizel\/wp-json\/wp\/v2\/media?parent=140"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}