Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
preprocessing problem
#1
Hi, I'm new in data science and python, I have a project to deal with a data set which contains malware API calls and the data set just has one column
I want to find if any rows of the data set contains a all items which are in other list:
the first row of the dataset is :
data set = ["Worm.Win32.Zwr.c","009a83236c600fd7ac034973f064284cec62f86631fe96e900cb664f86061431","GetSystemDirectoryA","IsDBCSLeadByte","LocalAlloc","CreateSemaphoreW","CreateSemaphoreA","GlobalAddAtomW","lstrcpynW","LoadLibraryExW","SearchPathW","CreateFileW","CreateFileMappingW","MapViewOfFileEx","GetSystemMetrics","RegisterClipboardFormatW","SystemParametersInfoW","GetDC","GetDeviceCaps","ReleaseDC","LocalAlloc","GetSysColor","GetSysColorBrush","GetStockObject","GetSystemMetrics","LoadCursorW","RegisterClassW","RegisterClassExW","LoadLibraryExW","LoadLibraryW","GetCommandLineA","GetStartupInfoA","LockResource","GetModuleFileNameA","IsBadWritePtr","RegisterClipboardFormatW","SystemParametersInfoW","GetSystemMetrics","LocalAlloc","GetSysColor","GetSysColorBrush","GetStockObject","LoadLibraryW","LoadLibraryExW","LoadCursorW","RegisterClassW","GetKeyboardType","GetCommandLineA","GetStartupInfoA","GetVersion","GetModuleFileNameA","lstrcpynA","GetThreadLocale","GetLocaleInfoW","GetLocaleInfoA","lstrlenA","LoadLibraryExW","SearchPathW","FindResourceExW","LoadResource","LoadStringA","LocalAlloc","VirtualAllocEx","GetThreadLocale","GetLocaleInfoA","GetLocaleInfoW","EnumCalendarInfoA","CreateEventA","LoadLibraryExW","lstrcpyA","CompareStringA","lstrcmpA","WaitForSingleObjectEx","WaitForSingleObject","GetProcessVersion","GlobalAlloc","DuplicateHandle","WSAStartup","LoadLibraryExW","CreateSemaphoreA","CreateSemaphoreW","ReleaseSemaphore","WaitForSingleObject","WaitForSingleObjectEx","GetWindowsDirectoryW","LocalAlloc","FindFirstFileExW","FindFirstFileA","GetModuleFileNameA","CreateFileW","WriteFile","CopyFileExW","CopyFileA","OpenEventW","WaitForSingleObject","WaitForSingleObjectEx","LoadLibraryW","LoadLibraryExW","DuplicateHandle","DeviceIoControl","SwitchToThread","WSACleanup","FreeLibrary","VirtualQueryEx","ResetEvent","VirtualFreeEx","UnregisterClassW"]


the other list that I want to know if all of it's items is in each row of the dataset:
stealthiness = ['FindFirstFileA', 'FindNextFileA', 'GetProcAddress', 'LoadLibraryA', 'OpenProcess', 'Sleep']
Reply
#2
use intersect to find like items in list
example:
common = list(set(a) & set(b))
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  multi index issue of one hot encoder preprocessing aupres 0 1,057 Jun-10-2022, 11:23 AM
Last Post: aupres
  How to perform preprocessing for hyperspectral image vokoyo 0 1,909 Oct-05-2019, 02:11 PM
Last Post: vokoyo

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020