StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POClass using std::vector allocation slower than pointer allocation by a LOT
primarykey
Id
20863478
data
AcceptedAnswerId
20863543
AnswerCount
3
ClosedDate
CommentCount
3
CommunityOwnedDate
CreationDate
2013-12-31T21:42:26.807
FavoriteCount
1
LastActivityDate
2014-01-01T00:32:51.990
LastEditDate
LastEditorUserId
0
OwnerUserId
1462718
ParentId
0
PostTypeId
1
Score
1
ViewCount
351
LastEditorDisplayName
text
Body
I have a class for handling allocation of arrays. My class is simple and it is defined as follows: DimArray.hpp: <pre><code>#ifndef DIMARRAY_HPP_INCLUDED #define DIMARRAY_HPP_INCLUDED #include <vector> template<typename T> class DimArray { private: int Width, Height; std::vector<T> Data; public: DimArray(int Width, int Height); DimArray(T* Data, int Width, int Height); DimArray(T** Data, int Width, int Height); DimArray(const DimArray &da); DimArray(DimArray &&da); inline int size() {return Width * Height;} inline int size() const {return Width * Height;} inline int width() {return Width;} inline int width() const {return Width;} inline int height() {return Height;} inline int height() const {return Height;} inline T* operator [](int Index) {return const_cast<T*>(Data.data()) + Height * Index;} inline const T* operator [](int Index) const {return Data.data() + Height * Index;} DimArray& operator = (DimArray da); }; template<typename T> DimArray<T>::DimArray(int Width, int Height) : Width(Width), Height(Height), Data(Width * Height, 0) {} template<typename T> DimArray<T>::DimArray(T* Data, int Width, int Height) : Width(Width), Height(Height), Data(Width * Height, 0) {std::copy(Data, Data + Width * Height, const_cast<T*>(this->Data.data()));} template<typename T> DimArray<T>::DimArray(T** Data, int Width, int Height) : Width(Width), Height(Height), Data(Width * Height, 0) {std::copy(Data[0], Data[0] + Width * Height, const_cast<T*>(this->Data.data()));} template<typename T> DimArray<T>::DimArray(const DimArray &da) : Width(da.Width), Height(da.Height), Data(da.Data) {} template<typename T> DimArray<T>::DimArray(DimArray &&da) : Width(std::move(da.Width)), Height(std::move(da.Height)), Data(std::move(da.Data)) {} template<typename T> DimArray<T>& DimArray<T>::operator = (DimArray<T> da) { this->Width = da.Width; this->Height = da.Height; this->Data.swap(da.Data); return *this; } #endif // DIMARRAY_HPP_INCLUDED </code></pre> For benchmark timing, I use the following: Timer.hpp: <pre><code>#ifndef TIME_HPP_INCLUDED #define TIME_HPP_INCLUDED #include <chrono> #if defined _WIN32 || defined _WIN64 #include <windows.h> template<typename T> class Timer { private: typedef T duration; typedef typename T::rep rep; typedef typename T::period period; typedef std::chrono::time_point<Timer, duration> time_point; std::chrono::time_point<Timer, duration> Time; static const bool is_steady = true; const rep g_Frequency = []() -> rep { LARGE_INTEGER frequency; QueryPerformanceFrequency(&frequency); return frequency.QuadPart; }(); inline std::chrono::time_point<Timer, duration> now() { LARGE_INTEGER count; QueryPerformanceCounter(&count); return time_point(duration(count.QuadPart * static_cast<rep>(period::den) / g_Frequency)); } public: inline void Start() {this->Time = this->now();} inline rep End() {return std::chrono::duration_cast<T>(this->now() - this->Time).count();} }; #else template<typename T> class Timer { private: static const bool is_steady = true; std::chrono::high_resolution_clock Clock; std::chrono::time_point<std::chrono::high_resolution_clock> Time; public: inline void Start() {this->Time = this->Clock.now();} inline T::rep End() {return std::chrono::duration_cast<T>(this->Clock.now() - this->Time).count();} }; #endif #endif // TIME_HPP_INCLUDED </code></pre> And my benchmark is as follows: <pre><code>int main() { Timer<std::chrono::nanoseconds> T; T.Start(); for (int i = 0; i < 100; ++i) { int** T2DArray = new int*[10000]; for (int i = 0; i < 10000; ++i) { T2DArray[i] = new int[10000]; } for (int i = 0; i < 10000; ++i) { delete[] T2DArray[i]; } delete[] T2DArray; } std::cout<<T.End()<<" us\n\n"; T.Start(); for (int i = 0; i < 100; ++i) { DimArray<int> TwoDArray(10000, 10000); } std::cout<<T.End()<<" us\n\n"; } </code></pre> The results it printed were: <pre><code>4.9599256 seconds //for int** 42.9303941 seconds //for DimArray<int> </code></pre> That's a huge difference! I cannot figure out why?! So I changed it to: <pre><code>int main() { Timer<std::chrono::nanoseconds> T; T.Start(); for (int i = 0; i < 100; ++i) { int** T2DArray = new int*[10000]; for (int i = 0; i < 10000; ++i) { T2DArray[i] = new int[10000]; } for (int i = 0; i < 10000; ++i) { delete[] T2DArray[i]; } delete[] T2DArray; } std::cout<<T.End()<<" us\n\n"; T.Start(); for (int i = 0; i < 100; ++i) { int* TwoDArray = new int[10000 * 10000]; delete[] TwoDArray; } std::cout<<T.End()<<" us\n\n"; } </code></pre> and the results were: <pre><code>4.6085725 seconds //for int** 0.1142958 seconds //for int* </code></pre> Any ideas why my class that uses <code>std::vector</code> is so slow compared to using a raw pointer?
Tags
<c++><memory-management><c++11><benchmarking>
Title
Class using std::vector allocation slower than pointer allocation by a LOT
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USBrandon
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POClass using std::vector allocation slower than pointer allocation by a LOT
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COYou ran it in the release mode, right?
 singulars
 PostPostId
 POClass using std::vector allocation slower than pointer allocation by a LOT
 UserUserId
 USuser2345215
2. COYes sir. Not in debug mode. G++ 4.8.1 on Windows 8.1, 2.5ghz processor, 8gb ram.
 singulars
 PostPostId
 POClass using std::vector allocation slower than pointer allocation by a LOT
 UserUserId
 USBrandon

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.